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HOST VARIABILITY IN DILUTION EXPERIMENTS 
PETER ARMITAGE 


Statistical Research Unit of the Medical Research Council 
London School of Hygiene and Tropical Medicine* 
London, England 


1. Introduction 


Moran [1954a, b] and Armitage and Spicer [1956] have discussed 
the theory of experiments in which doses containing different numbers 
of infectious particles are administered to groups of host organisms, 
and the proportion of hosts infected at each dose is recorded. Suppose 
that, for any one host, a proportion p of the particles will initiate a fatal 
infection on any one occasion, and let p be a random variable with 
density function f(p). The problems commonly discussed are those of 
estimating certain characteristics of f(p), and of testing the null hy- 
pothesis, Hy , that p is constant. 

Armitage and Spicer [1956] stated (pp. 408, 413) that a maximum 
likelihood approach to the estimation problem was inappropriate, in 
particular because the expected values, on H, , of the second derivatives 
of the likelihood function do not all converge. This view appears now 
to be incorrect, and the purpose of this paper is to discuss the maximum 
likelihood approach to this problem, and to describe tests of Hy appro- 
priate in various circumstances. 


2. Type III distribution of p 


Suppose that, at the zth dilution, n; hosts are exposed of which r; 
become infected. It is convenient analytically to assume that p follows 
a Type III distribution. This assumption is theoretically objectionable 
in that the Type III distribution is not restricted to values of p in 
(0, 1), but if the mean value of p is small, the anomaly may not be 
practically important. As in the previous paper, then, let 
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*This work was done during visits to the Department of Statistics, University of California, 
Berkeley, California, and to the National Institutes of Health, Bethesda, Maryland. 
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corresponding to a Type III distribution with a mean of j, and for 
which the ratio of the standard deviation to the mean is V’””. Then, as 
shown previously, the expected proportion of uninfected hosts, at a 
dose of \,; particles, is 


where y; = A,p. 
Write y; = yz; , where 7 is the value of y,; at zero dilution, and z; 
is a dilution factor. Then 
log P; = —V™ log (1 + 27V), 


and the log likelihood is 
L = log P; + Sor; log — P) + D log (™). 


(Here and throughout the paper the summations are taken over all 
values of 7 used in the experiment.) Moran [1954a] gives the likelihood 
equations, using (in our notation) k = yV and 1 = V~’ as parameters. 
It is convenient for our purposes to work directly in terms of V and y. 
Writing Q; = 1 — P; , we have 


oP; 
n,Q; — || log P; 


= oL nQ: — 


The expectations of the second derivatives are 


by = = | ’ 
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On IT, , i.e. as V — 0, we find 


4 
iP; 
Ly? Ly = 


For constant n; = n, and for 2-fold dilutions over a wide range, the 
summations in (2) are close to the corresponding integrals (cf. similar 
situations in Moran [1954a, b] and Armitage & Spicer [1956]). Evaluation 
of the integrals gives: 


—2.34219n 
L°, 1.73420n/y (3) 
and L°, —2.37313n/y’. 


Maximum likelihood (m.l.) estimates, V and 7, can be obtained by 
standard procedures. (See, for instance, Rao [1952], Sec. 4¢c.2.) For 
asymptotic confidence limits the variances of V and 7 should presumably 
be expressed in terms of the L;; , evaluated at V, 7 and the estimated 
values of P;. Asa test of H, we should estimate variances in terms of 
the L°; ; thus V[—L?, + (L!,)?/L,]'” may be taken as a normal deviate 
N(0, 1). 

However, an asymptotically equivalent test of H) may be carried 
out without evaluation of V and 7. Let % be the m.]. estimator of 
subject to the condition V = 0. (Under this condition P; = e 7’, 
and a number of procedures are available for obtaining 7» : cf. Finney 
[1952, §21.5] and Peto [1953].) Denote by (L;)y-,,- the value of dL/aV 
(t = 1) or 0L/dy (i = 2), evaluated at V = V’,y = y’. Then, asymp- 
totically, we have with probability 1, 


(Li)o,y + — y) Li 


and 


0= (Le)o,4. (Le)o, +@- 7) Le 


Hence, 


¢= ~ (Li)o.y = (Li2/L22)(Lz)o, 
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and is consequently distributed asymptotically normally with zero 
mean and variance 


+ 


That a test based on (Z,)o,;, is asymptotically equivalent to one based 
on V follows from the fact that 


To show this, we can expand (L;)y,;, in terms both of » and of 7 . 
Thus (the equivalences always being understood to hold with probability 
l,asn — &), 


(Li) (L1)o,4. + ~O+ (Fo 9) Li. 
Hence 
Similarly, 
(L.)¢.5, ~ 0+ VLi, ~ 0+ Go — (6) 


Elimination of (fo — 7) from (5) and (6) yields (4). 
Now, from (1) 


= niQ; (Fox)? 
(L1)0.4. >| | 2 


and, from (3), 
var(¢) = + (Li2)’/L2 
1.0749n, when n; =n. 


The statistic ¢ is readily calculated in terms of the m.1. estimate 7) and 
the Q; (the estimates of Q; based on 4,). For the use of essentially 
the same method in other problems, see Bailey [1956] and Neyman 
[1955]. 

Suppose now that y/ is a consistent, but inefficient estimate of y, 
calculated on the assumption that H, is true, and that yj has efficiency 
greater than zero. Then, by a similar argument to that used earlier 
we find that, with probability 1, 


= — 40° (7) 


Hence, ¢’ may be used as an asymptotically normal deviate with zero 
mean and variance 1.0749n as before. 
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From (1), (3), and (7), we find 


n.Qi — i iG 


where the Q/ are estimates of the Q; obtained from vf . 

Now, in the problem of estimating the parameter in an exponential 
response curve (commonly referred to as the “dilution method’’), a 
very simple estimate may be obtained from >> r; or > (n — r;) [Fisher, 
1922; Fisher and Yates, 1953]. This estimate has an efficiency of 
87.7%, but is not strictly consistent, owing to a small bias depending 
on the position of the dilutions. This effect, although clearly affecting 
the asymptotic properties of the test, is perhaps negligible in samples 
of the size usually employed, and it would seem reasonable to take 
i‘, to be the Fisher estimate. Tables for calculation of 7) are available 
in Fisher and Yates [1953]. 

As an example, consider the data of Parker, quoted by Moran 
[1954a] and Armitage and Spicer [1956]. As before, the original data 
have been modified so as to give n; = 40 at all doses. At dilutions 
1, 2-', --- , 27%, the numbers of infections out of 40 are, respectively, 
40, 35, 32, 25, 19, 10, 8, 5, and 4. Anm.l. solution by the log-log method 
(Finney, [1952], §21.5) gives 


E ri Gor)” _ 98 3, 


and S.E. (¢) = 
The inefficient method gives 


ty \2 


= 21.9, ¢’ = 56.8 — 16.0 = 40.8, 


and 8.E. (¢’) = 

The discrepancy between the statistics ¢ and ¢’ presumably arises 
because H, is strongly contradicted by both. The asymptotic equiva- 
lence given by (7) is, in these circumstances, hardly applicable. 

As an example showing greater consistency with Hy , consider the 
following data, obtained by artificially modifying the responses of the 
previous example. At dilutions 1, 2~*, --- , 2~*, we shall suppose the 
numbers of infections out of 40 to be 40, 39, 33, 25, 19, 10, 8, 4, 2. 
The m.]. solution gives 


4.= 849, o@=9.1, and S.E.@) = 6.6. 
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The inefficient method gives 


%~=891, > ES = 118, 


= 5.0, ¢’ = 11.8 — 3.7 = 8.1, 


and 8.E. (¢’) = 6.6. 
As we should expect, ¢ and ¢’ are in closer agreement in this example 
than in the previous one. 

Armitage and Spicer discussed the asymptotic relative efficiencies of 
two quick tests of H,. For a statistic y, defining 


Rwy) = {5% wert 


they found R? = 0.4062n when y is Moran’s statistic 7, and R’? = 
0.2882n when y is the Spearman-Karber estimate of the variance of a 
tolerance distribution. For the m.l. estimate, V, we have 


RV) = 1/[varV]y, = —L°, + = 1.0749n. 


Clearly, the efficiency of either of the two quick methods is low. The 
basic reason for this appears to be that a small degree of variability in p 
causes a flattening of the response curve (i.e., that of P; or Q; against 
x; or log x,;) primarily at low dilutions, where the survival rate is low 
(cf. Armitage and Spicer, particularly Figs. 1 and 2). The two quick 
tests take no account of this, whereas the maximum likelihood test is 
particularly sensitive to departures from the exponential shape in this 
particular region. That this is so may be seen from the fact that ¢ 
is a weighted sum of deviations of observed from expected frequencies, 
the weights increasing with the estimated density of particles. It 
should also be noted that, in the above discussion, var (¢) has been 
approximated in terms of the integrals corresponding to the summations 
in (2). The contributions to these summations come largely from 
results at low dilutions, where survival rates are low, and if the choice 
of dilutions was such that low survival rates were not represented, 
var (¢) would be appreciably increased. 


3. General considerations 


To what extent is it reasonable to test H, , rather than to estimate 
the parameters of f(p)? Let us distinguish between two different 
situations, both of which require p to assume a value x < 1 for a par- 
ticular host; (i) the particles are of two kinds, a proportion 7 being 
invariably infective when inoculated into a host organism, the remaining 
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proportion 1 — z being inert, i.e. never infective; (ii) some at least of 
the particles are neither invariably infective nor inert. Situation (i) 
includes the case in which x = 1, as in the dilution method of bacterial 
counting, where the medium (i.e. host) is such as to ensure the multipli- 
cation of any bacterium (i.e. particle) which may be present. In 
situation (ii) the outcome of the inoculation of a particular particle 
into a particular host is, at least sometimes, a matter of chance. It 
may be, for instance, that infection only occurs when an inoculated 
particle settles on a susceptible site in the host, and that this is not a 
certain event. 

Now, in situation (i) it is reasonable that H, should be true, i.e. 
that x should be constant from host to host, since 7 is a characteristic 
of the population of particles. In situation (ii), on the other hand, Hy 
is inherently implausible, since the probability of infection by a ran- 
domly chosen particle is a characteristic of the host and will presumably 
vary to some extent from host to host as does any other continuous 
measurement. If we knew that situation (ii) was applicable, then, it 
would be more reasonable to make estimates of the parameters of 
f(p) than to test H, . Frequently we shall be unable a priori to dis- 
tinguish between (i) and (ii), and we can regard a test of H, as effectively 
a test of the hypothesis (i), in that a significant contradiction of H, 
will indicate that (ii) holds. 

Slight departures from situation (i) may give rise to distributions 
f(p) which are not well represented by Type III curves. Suppose, as 
before, that a proportion 1 — = of particles are inert, but that a randomly 
chosen particle from the infective proportion 7 has a probability p’ of 
infecting a particular host. Then p = zp’, where 7 is constant between 
hosts. It would be reasonable to choose, for the distribution of p’, 
one with a mean slightly below unity, and with negative skewness. 
The negatively skew truncated exponential considered by Armitage 
and Spicer seems the most promising candidate, and we now consider 
this distribution. 


4. Truncated exponential distribution of p 
Let the density function for p’ be 


g(p’) = 


This distribution has mean 
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and variance 


— 1)’ 


The proportion of uninfected hosts is 


1 


Write K = k™*. For small K, the mean and variance of p are respec- 
tively — K) + and + Since we are con- 
sidering the effect of small values of K, we may define y; = \,7(1 — K), 
(which is asymptotically equivalent to the previous definition y, = \;p), 
and write y; = y2 . 

We have, as K > 0, P; = e'* (1 — A;rK)™ + 0(K") for any 
j > 0, and, after some reduction, 


P; = + + | (8) 
and 
2 2 V 2 
log = + AK’) = + + O(V), 
where V = K’. Again, this is asymptotically equivalent to the previous 


definition of V, i.e. the ratio of the variance to the square of the mean. 
On H, , 


oL nQ; (yx;)° 
ay | 2° 
0 


These are exactly the same as the previous expressions for (L,)o,, and 
° 

Similarly, the expected values, on Ho, of the second derivatives are 
exactly as given by (2). It is clear, therefore, that the asymptotic tests 
based on ¢ and ¢’, which were described above, are valid also for the 
present model. Furthermore, the expansion (8) for P; may be derived 
immediately from the general expansion for P; in terms of moments 
of f(p) [Armitage and Spicer’s equation (3)], on the assumptions that 
ua(p)/p = K* + o({K”) and u,(p)/p' = o(K*),j > 2. It would appear 
that the tests based on ¢ and ¢’ have a validity unrestricted by strong 
assumptions about the particular functional form of f(p). 
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The stimulus to re-examine the maximum likelihood solution to this 
problem arose from some seminars given by Professor J. Neyman. 
The use of efficient test criteria like ¢’, based on inefficient statistics is, 
I understand, the subject of some unpublished investigations by Pro- 
fessor Neyman and Professor L. LeCam. Conversations with Mr. 
Nathan Mantel have helped to clarify the views expressed in §3. 
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RESTRICTED SELECTION INDICES’ 


Oscark KEMPTHORNE AND ARNE W. Norpskoa 


Towa State College 
Ames, Iowa, U.S.A. 


Introduction 


Smith [1936] developed a discriminant function for the selection of 
varieties according to their genotypic values in the presence of errors of 
observations. Smith’s argument is applicable to the selection of varieties 
which are self-perpetuating and, essentially, involves no genetic theory. 
Hazel [1943] extended this technique to the case when one wishes to 
select individuals whose progeny will be of superior merit by assuming 
that each individual has a true unknown “breeding value” (half of 
which is expressed in its offspring) and the correlations of “breeding 
values” with observed phenotypic expressions are known. It is not 
our purpose to expound the assumptions necessary for utility of such 
a genetic index, but it appears that predictions of increase in population 
mean from selection and mating of selected individuals will be correct 
apart from sampling and estimation error if the following conditions 
hold: 

(1) the phenotypic value (P;) defined as the observed value of 
attribute z for an individual shall be made up additively of two 
parts, a genotypic value (G,) defined as the average of the 
phenotypic values possible over a population of environments 
and an environmental contribution (Z,), ie., P; = G; + E; . 
It is possible to permit interactions of genotype and environ- 
ment provided genotypes and environments are associated 
entirely at random and any such interaction then is incorporated 
into £; . 

(2) the genotypic value G, is composed entirely of additive effects of 
genes and is then, appropriately, also called the breeding value. 

(3) with attributes denoted by i = 1, 2, --- , m, the genotypic 
economic value of an individual is H = >. a,G; , where the a; 


1Journal Paper No. J-3462 of the Iowa Agricultural and Home Economics Experiment Station. 
Ames, Iowa. Projects 890 and 1039. 
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are known economic constants, and G; ,7 = 1, 2, --- , m, are 
the genotypic values of the individual for the m attributes. 
(4) the quantities P; and H are such that the regression of H on 
any linear function of the P, is linear. 
Condition (2) above will rarely hold and for development of an index 
it is assumed that G,; represents the breeding value which is the least 
squares prediction of G; on the basis of a model containing only additive 
effects of genes, and that deviations of actual G; from predicted values 
behave completely as random variables. With this assumption the 
offspring of a mating will have a genotypic value equal to the average 
of the breeding values of the parents. The symbol G; is used in this 
paper for both genotypic value and breeding value, but it should be 
borne in mind that this usage is based on an approximation. 
Under these circumstances, if selection based on a linear function 
I = >>; b,P; is made, the gain in H is equal to 8y,A,; where A, is the 
difference in mean J resulting from the selection, i.e., the difference 
of the mean of J after selection and the mean before selection, which is 
the selection differential. This can also be written as 


On Ar 
= pur — Ar = purtu\— ]° 


The above expression shows that the genetic change due to selection is 
proportional to pz; (not pz; , a8 might at first be thought) and to A;/o, 
which is the selection differential in the index in standard deviation 
units. If J is in addition normally distributed, the }b,’s are known 
without error and truncation selection with a saving of the top p per cent 
is used, then the relationship of p to A;/c; is given by the two equations 


dx 
k V2r 


which defines k as the normit (or probit minus 5) corresponding to p and 


Cov (H, I 


Or Pde V2rP 


where z is the ordinate of the normal distribution above which there is 
the fraction p of the distribution. The choice of selection index then 
reduces to finding that linear function J = )_, b,P; which correlates 
best with the index H. As shown by Smith and Hazel, the equations 
determining the b’s are Pb = Ga where P is the matrix of phenotypic 
variances and covariances, the zjth element P;; being the covariance 
of P; and P; , and G is the matrix of genotypic variances and covariances 
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so that G;,; is the covariance of G; and G; and a is the column array of 
economic constants or weights. 

With the stated assumptions, the index >> b;P,; obtained maximizes 
gain in the genotypic economic value H when errors of estimation are 
ignored. While use of J will result in best progress in H, the means of 
the G; will change in either a positive or negative direction, so that a 
breeder may well be interested in increasing H as much as possible 
with a restriction that some G; or some linear functions of the G; will 
not change. For example, a poultry breeder may feel that he should 
keep mean egg size at a constant intermediate level while using an 
index to maximize progress in genetic economic value based on egg 
weight, body weight, and production. It was in fact such a situation 
which led to the present note. 


The Mathematical Solution 
We shall suppose that the breeder wishes to maximize progress in 
H given by 
H = 4,6, + a.G, + + anGn , 


but he wishes no change in r(< m) quantities V; given by 


i=l i=l 
in which the coefficients a; and c;; ,7 = 1, 2, --- ,r;j7 = 1,2, ---,n 
are given. A common circumstance would be that in which there is one 
linear function V, and only one coefficient c;; is unity, the others being 
zero. 

We are given P, , P, , --- , P,, , and are to determine the linear 
function J = z: b;P; so that the correlation of J with H is a maximum 
subject to the conditions 


Cov(J, V.) = 0, k= 1,2, --- 
Now 


For brevity of manipulation it is convenient to utilize matrix notation 
as follows: 


i 
| 
| 


H 
1 
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a, by | 
a, b 

a=| b=| 
_| 

Cy Ci2 Ci, 

C22 Cor 

LCm2_] LCmr— 


Also, since the correlation p;, is unaffected by multiplication of the 
b’s by a constant, we take 


Then 


(a’Gb)’ 
fan = (a’Ga) 


Cov(I, = b’Gc, = ciGb, k=1,2,--- 


and we have to maximize p;,, subject to the r conditions 
Cov(/, = 0 


and the condition b’ Pb = 1. 
Introducing (r + 1) undetermined Lagrange multipliers, 2, and », 
we have to maximize without restriction the quantity 


(a’Gb)* b)? 
(a’Ga) 


with regard to the b’s. 
Differentiating with regard to b; we have 


—2 A(ckGb) — v(b’/Pb — 1) 


9 Gb) 
Ga) 


(a’G); — 2 (iG); — 2(b’P); = 


Arranging these equations in matrix form we have 


(a’Ga) (a’G) = > A(ciG) + vb’P. 


| 
j 
| 
. 
‘ 
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So 
or 
1 (a ‘Gb) 
b= Ga — UMP Ge, 
But 
ciGb = 0 for l=1,2,---,r, 
so 
o = — . 
(a’Ga) 
If we use the matrices C = (c¢, ,c. , --- , c,) so that C’ isr X mand C 
is m x rand = (A, pads 
then 
(a’Ga) 
so that 
= (C’GP“GC)" C’'GP”Ga. 
Also the equation defining b can be written 
= 1 (a’Gb) 1 
(a’Gb) _ "YD-1 -1 
{1 — P 'GC(C’GP 


We may now relax the condition that b’Pb equals unity and drop 
the term (a’Gb)/v(a’Ga) which is merely a constant multiplier to all 
the b’s to give the index in final form 


b = [I — P™'Ga. 
We note that if there were no restrictions the index would be b = P~'Ga, 


so that the effect of the restrictions is to multiply this unrestricted 
index by a matrix. Also we find | 


2 _ — "Ga, 
Pru >= a ’Ga 


We verify that the conditions c/Gb = 0 or matrix-wise C’GB = 0 
are satisfied since 


C’Gb = C’G[I — 


{ 
' 
q 
| 
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= [C’G — (C’GP™'GC)(C’GP GC) 
= (C’G — C’G)P'Ga 
= 0. 


The results of using the index are as follows. The change in G, 
resulting from a change A; in the index is equal to 


WAG; /OV I 
CovG, Pe Cov(G ) Ar 
0; 
This is equal to 
biG 


which can be worked out in any particular case. 
Examples 

The methods are illustrated below for a situation with poultry. The 
estimates of genetic parameters used are given in Tables 1 and 2. 


TABLE 1 
VARIANCES AND HERITABILITIES 


Variance Heritability 
BW Mature body wt. (Ibs.) .34 .45 
EW Mature egg wt. (0z/doz.) 2.4 .50 
SM Age Ist egg (days) 13. .40 
BS __ Blood spots (fraction) 50 x 10-5 15 
EP Egg production (to 72 weeks) 512 .20 
TABLE 2 
PHENOTYPIC AND GENOTYPIC CORRELATIONS 
BW EW SM BS EP 

BW 0 0 — .05 

EW .30 0 0 — .05 

SM 0 0 —.02 —.40 

BS 0 0 — .07 + .03 

EP .15 15 — .50 .08 


Phenotypic correlations are above the diagonal and genetic correlations below. 


| 
{ 
| 
Pais 
| 
} 
{ 
{ 
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To ease the purely computational problem, the variables were 
coded to have variances of approximately the same order of magnitude 
as follows: 


Attribute Equal to 


10 BW 
3 EW 
SM 
EP/3 
10° BS 


or WN 


This leads to matrices P and G which are of course symmetric: 


34 6.77504 0 0 — 2.19881 
6.77504 21.6 0 0 — 1.75256 
P= 0 0 13 — .50991 —10.87709 
0 0 — .50991 50 1.59986 


_— 2.19881 — 1.75256 —10.87706 1.59986 56.88 


| 15.30 3.855027 0 0 1.97825 | 
3.85503 10.80 0 0 1.66260 
G=| 0 0 5.20 43716 — 3.84567 
0 0 — 43716 7.50 .73894 

_ 1.97825 1.66260 — 3.84567 .73894 11.376 


The matrices P and G are the primary ingredients and the only require- 
ment for obtaining the unrestricted index is to determine the economic 
weights for attributes 1 to 5. 

The economic weights were chosen as indicated in Table 3. 


The entry in the last column is obtained by noting, for example, that 
a change of 1 unit in 3 EW is equal to a change of } unit in EW which 
is worth 21.6/3 = 7.2. Alternatively, the index of value is 


—25.0 BW + 21.6EW +0SM —7.9BS + 3.6 EW 
= —2.50 (lOBW) + 7.20 (LW) + 0 SM 
— 0.0079 (10°BS) + 10.80 (EW/3). 


| 

| 

4 
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TABLE 3 
Economic WEIGHTINGS 
Value of unit 

Trait Unit Real value Coded change in 

(cents) trait coded trait 
BW 1 Ib. —25.0 a = 10BW —2.50 
EW 1 oz/doz. 21.6 tz = 3 EW 7.20 
SM 1 day 0 z; = SM 0 
BS 1% — 7.9 zs = 10° BS — .0079 
EP 1 egg 3.6 ts = EP/3 10.80 


The following calculations are presented in terms of coded traits and 
the index and changes may be decoded by substituting 10BW for z, , 
3EW for x, , and so on. 


The unrestricted selection index has coefficients which are the 
elements of P~'Ga and is 


—0.39592, + 4.28192, — 1.41112, + 0.07592, + 2.12812; . 


Next we compute an index designed to lead to no change in attribute 1. 
For this case 


C’G = (15.30 3.85503 0 0 1.97825), 


i.e., the matrix consisting of the first row of G. The matrix C’GP™'GC 
has dimensions 1 X 1 and is a scalar equal to 71.09014 and insertion 
of the appropriate matrices in the expression for the coefficients of the 
index leads to the index 


— 1.31432, + 4.19132, — 1.52062; + 0.11022, + 1.99702; . 


As a second example, an index maximizing the gain in economic value 
but keeping x, and z, constant was constructed. For this 


= 15.30 3.85503 0 0 
3.85503 10.80 0 0 1.66261 
consists of the first two rows of G and 
= 7.10901 
2.29706 5.54508 
This leads to the index 
—0.16852, — 0.20072. — 1.60077; + 0.08612, + 1.69442; . 


| 
Wer. 
| 
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As a final example an index was computed, solely for illustrative 
purposes, keeping z, + 0.1z, constant. For this 


C’G = (15.68550 4.93503 0 O 2.14451) 
and C’GP™'GC = 7.62388. This leads to the index 
— 1.53862, + 4.04217, — 1.55832, + 0.08017, + 1.95192; . 


The different cases are summarized in Table 4. These examples indicate 
the general effects of restriction of index. For instance, with increasing 
restriction the correlation of index with economic value decreases. 
The weights attached to some attributes are essentially unchanged with 
the imposition of restrictions while others change because of the par- 
ticular correlational structure. 

The important question of connection between the economic weights 
and the restrictions has been raised by a referee. One can imagine 
taking account of a desire to maintain egg weight, for example, at a 
nearly constant value of 24 by including in the total economic value 


TABLE 4 
SuMMARY OF EXAMPLES 


Unrestricted constant , constant +0.122constant 

b* ae b A b A b A 
|—0.39 0.55 |-1.31 0 |-0.17 0 |—1.54 —0.16 
Xe 4.28 1.82 4.19 1.68] —0.20 0 4.04 +1.60 
zs; |—1.41 —0.59 |—1.52 —0.31)—1.60 —0.92)—1.56 —0.61 
x 0.08 0.10 0.11 0.11) 0.09 0.16} 0.08 0.11 
Xs 2.13 1.36 2.00 1.28] 1.69 1.54) 1.95 1.25 
p . 1873 . 1832 .1174 . 1805 


*The coefficient in the index. 


**The expected genetic change in the attribute with selection differential equal 1 standard deviation 
unit of the index. 


function a term such as —k(egg weight — 24)” where k is a prechosen 
number, instead of a term such as 21.6 egg weight, or 6EW or 18EW/3. 
This would permit some change in egg weight but changes in either 
direction from the optimum (24 in the above illustration) would carry 
a negative value and the decrease in value would increase as the square 
of the deviation from the optimum value. This could be handled by 
the ordinary selection index process since the term 


| 
| 
| 
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—k (egg weight — 24)° 
is equal to 


—k (egg weight)? + 48k egg weight — 576. 


To carry out the estimation of index we would need to know the corre- 
lations, phenotypic and genotypic, of egg weight and (egg weight)” 
with each other and with the other attributes, as well as the phenotypic 
and genotypic variances of egg weight and (egg weight)”. It would of 
course be a routine procedure to obtain the necessary estimates. 

The procedure given here permits no genetic change in the chosen 
attribute, attributes, or linear functions of attributes, and may be 
unduly restrictive from some points of view. It is not entirely obvious 
but can be shown that if the restricted index is to keep say attribute 1 
constant, then the weight associated with attribute 1 in the genotypic 
economic value is irrelevant. 


Conclusions 


The purpose of the present note is to give a derivation and examples 
of restricted selection indices. A further development of indices requir- 
ing some genetic changes to be of particular sign will be presented in a 
later paper. 


REFERENCES 


1. Smith, H. Fairfield [1936]. A discriminant function for plant selection. Ann. 
Eugen. Lond. 7, 240-250. 


2. Hazel, L. N. [1943]. The genetic basis for constructing selection indexes. Genetics 
28, 476-490. 


4 
i 
| 
F 
4 
| 
Niet 


EQUILIBRIA IN AUTO-TETRAPLOIDS UNDER NATURAL 
SELECTION FOR A SIMPLIFIED MODEL OF VIABILITIES 


P. A. Parsons 


Department of Genetics and Trinity College 
Cambridge, England 


1. Introduction 


When examining the equilibria of a disomic population, the gene 
frequencies of two alleles can be followed from generation to generation. 
If p is the frequency of gene G, and q the frequency of its recessive 
allele g such that p + q = 1 and if a, h, and b are the viabilities of GG, 
Gg, and gg respectively, we get: 


Genotypes GG Gq 99 Total 
Zygotes 2pq ¢ 1 
Zygotes after 
selection ap" 2hpq bq’ S = ap’ + 2hpq + bi? 
and 


p’ = (ap’ + hpg)/S 
q’ = + bq’)/S 


where p’ and q’ represent the gene frequencies in the next generation. 
Such systems have been examined in detail by Fisher [1930]. 

When tetrasomics are considered, the recurrence relations analogous 
to these will involve not gene frequencies but gametic genotype fre- 
quencies. If there are two alleles, say G and g, there are three possible 
gametic genotypes GG, Gg, and gg, whose respective frequencies may 
be denoted by wu, v, and w respectively such that 


utov+t+we=l. 


If the five possible zygotic genotypes, which can be denoted for brevity 
as G, (GGGG when written in full), G39, Gag. , Gg, , and g, , have viabilities 
a, b, c, d, and e respectively, the frequencies of the five zygotes before 
and after selection are: 


20 


a. 
| 
| 
| 
i 
| 
< 
| 


EQUILIBRIA IN AUTO-TETRAPLOIDS 21 
Before selection After selection 

u? au? 

Gsg 2uv 2buv 

v? + 2uw c(v? + 2uw) 

2ow 2dvw 

Ys w? ew? 

Total 1 S 


As each gamete is a gene pair taken at random from the zygote, the 
gametic proportions are: 


G, : au” + buv + icv” + 2uw) 
Gg: buv + 3c(v? + 2uw) + dow 
go: + 2uw) + dew + ew" 


and the proportions wu’, v’, and w’ in the next generation can be found 
from 


Su’ = au? + bw + tc(v” + 2uw) 
Sv’ = buv + 3c(v? + 2uw) + dow (1) 
Sw’ = + Quw) + dow + eu’. 


The determination of equilibrium points for five different viabilities is 
difficult, but it is of interest to consider a symmetrical case where 
a = e,b = d, andc = 1, thus reducing the number of viability factors 
to be considered. Thus we now have 


Su’ = au? + bu + 20 + Quw) 
Sv’ = buv + 3(v? + 2uw) + bow (2) 
Sw’ = 4? + Quw) + bow + aw’. 

2. Trivial Equilibria 


Firstly, the trivial equilibria obtained by putting u = 0, v = 0, 
and w = 0 in turn will be examined. In (2) 


if w=0, then w=1, 


e 
| 


=Q@0, then u=1 or 
and 
if w=0, then wu 


Il 


og 
| 
| 
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Hence there are two trivial equilibria in which the population is all 
G, or g, and by symmetry these will both be stable or unstable. 

If, initially, the population is nearly all G, and has small values for 
u and v, w being 1 — u — »v, then neglecting second order terms 


S=a-+t (2 — + (2b — 2av 
and, to this approximation, from (2) we get 
aw’ = u/3 
av’ = gu + bo 


which gives a linear transformation with latent roots of 3a and b/a. 
For equilibrium, both latent roots need to be less than unity. Hence 
the necessary and sufficient conditions for the trivial equilibria to be 
stable are a > 3, and b < a. 


(3) 


3. Non-trivial Equilibria 
The equilibria that remain are characterized by having none of 
u, v, and w equal to zero. From (2) we can therefore write 


u au’ + bw + 30° + 2uw) 


w aw’ + bw + 30" + Qu) 


which yields the two solutions, 
either 


u=w (4) 
or 
+ 2uw = 6auw, (5) 


the first solution being symmetric as u = w = (1 — v)/2 and the second 
asymmetric. 


3.1 The Symmetric Case 
Putting u = w = (1 — v)/2 in the relation 
Sv’ = + + Quw) + bow, 


which has already been given (1), gives an equation inv. At equilibrium 
when v’ = », this equation becomes 


fv) = (9 + 3a — 12b) + v°(—12 — 6a + 18d) 
+ vo(7 + 3a — 6b) —-2=0. 


In Table 1, equilibrium values of v for a and b between 0 and 256 are 
given, an equilibrium point always being in existence. 


(6) 
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Examination of this table shows that fora < 1,3 <v < 3. In 
(6) if v = 4, f(v) = —3 + 3a, and f(?) = 2a + 4b which is always > 0. 
When f(3)-f(?) < 0, the graph of (6) must increase from a negative 
value at v = 3 toa positive at v = 3. This condition holds for a < 1. 
Hence, for a < 1 there are one or three roots between 3 and 3. In 
fact, only one root for any situation has been found. Similarly for 
a > 1, it can be shown that there are one or three roots for 0 < v < 3, 
only one of which has been found. 


3.2 The Asymmetric Case 
Asymmetric equilibria are characterized by the equation (5) 
v? + 2uw = 6auw 
in which u ~ w. Rearranging gives 
uw = v’/2(3a — 1). (8) 


For any asymmetric equilibria, stable or unstable, a > }. Substituting 
(5) in (2) gives 


Sv 


bo(u + w) + 4auw 
and 


atu + w) + bv. 
Hence 

av(u + w) + bv? = bo(u + w) + 4aww. 
Rearranging and putting uw = v’/2(3a — 1) gives 


1 


1+(o- 
Using the relation u + w = 1 — v and equation (8), we obtain 
0 < (u — w) = (1 — v) 
Hence 
+ V2/V3a — 1]. 
From (9) 


2a 1 2 
1 > 1 (10) 


Either b > a when 


; a 
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2a 2 
3a — 


b— 3a 


which reduces to b > a > 1; or b < a when 


2a 2 
ys 3a — 1 


which reduces to b < a < 1. 
The conditions for an asymmetric equilibrium to exist are therefore 


a> 1/3 
and 

b<a<l 
or 

b>a>l. 


In Table 2, values of v, u, w are tabulated for a range of values of a and b. 
For any value of v there are always two asymmetric equilibria obtainable 
by interchanging the values for u and w. At the boundary conditions it 
will be noted that the asymmetric equilibria collapse into trivial or 
symmetric equilibria. 


4. Stability of Equilibria 


The procedure for examining the stability of any equilibrium state 
has been applied to genetical situations by Owen [1953]. In place of 
u, v, and w we can put wu + 2,v — x — y, andw + y giving u, v, and 
w their equilibrium values. If x’ represents the next generation, we 
can write 


ute’ wty). 


Neglecting quadratic terms in x and y the relation 


bu ov 


is found and also a similar relation for y’, where u, v, and w are put at 
their equilibrium values after differentiation. ‘These are recurrence 
relations and can be written 


+ 
y’ = byw + dooy. 
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The equations in x and y have solutions of the form 
= + Bide 
Yn = + Bad? 


where \, , Az are the latent roots of the matrix (b;;). The equilibrium 
condition is stable if | \, |, | A. | are both less than unity. 

The matrix obtained is given in Table 3. For the symmetric equi- 
librium, putting u = w gives 


2au + bv 
+ 4buv + v? + Qu? 
2au(1 — 2u) + b(4u — 1)(2u — v) + — — u) 


+ 4buv + + 


By calculating \, and ), for the values of v in Table 1, it can be shown 
that the symmetric equilibria are stable for } < v < 3 anda < 1 and 
unstable outside this range. 

For the asymmetric case, the equilibria are best examined directly 
from the matrix in Table 3 as the algebraic expressions for \, , Az are 
more complex. Examination of the asymmetric equilibria in this manner 
shows that both equilibria are stable for 


b>a>1 
and unstable for 


Combining the information from the trivial, asymmetric, and 
symmetric equilibria gives the complete solution to the problem: 


Equilibria 
a b Trivial Asymmetric Symmetric 
>b, <1 | u 
>43, <b, <1 <1 8 
<i >1 s 
>i <a u 
>a u u 


u = unstable, s = stable. 


7 4 
27 
| | 
: 
| 
jo 


+ magz + + angz+ =g 


BIOMETRICS, MARCH 1959 


— a(t — + — at — + 
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(m — —} 
— a(t — + — — + 
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Thus it can be said: 

1. There is atengs one symmetric equilibrium which is stable for 
4<v< Zanda <1. 

2. There are ae two trivial equilibria stable when a > 3, 

3. Two asymmetric equilibria exist when a > 3 and b > a 

b <a < 1 and are stable for a > 1. 

4. If the asymmetric equilibria are stable, both the symmetric and 
trivial equilibria are unstable. 

5. If the asymmetric equilibria are unstable, both the symmetric and 
trivial equilibria are stable. 

The model examined here therefore allows a maximum of two stable 
non-trivial equilibria. Owen [1953] has constructed an example in 
disomics for which three distinct non-trivial equilibria exist. In his 
example viability differences in the two sexes between homozygotes 
and heterozygotes are in opposite senses, this being manifest by a 
large disparity of gene ratios in the two sexes. More complex situations 
in tetrasomics where there are more than two viability factors, or a 
sex difference in viability, could presumably lead to situations in which 
more than two stable non-trivial equilibria are possible. 


> bor 


5. Summary 


Using a simplified model of viabilities in tetrasomics under random 
mating, it is shown that more than one stable non-trivial equilibrium 
point can be found. 
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THE ANALYSIS OF A NON-REPLICATED EXPERIMENT 
INVOLVING A SINGLE FOUR-COURSE ROTATION OF CROPS 


H. D. Patrerson 


Rothamsted Experimental Station 
Harpenden, Hertsfordshire, England 


Introduction 


The analysis of crop rotation experiments presents a number of 
problems which are not encountered in the analysis of annual experi- 
ments. Some of these problems have been discussed in a general 
account of long-term experiments by Cochran [1939]. More recently 
methods of analysing two types of rotation experiment have been 
considered in greater detail. These two types are: 

(a) experiments comparing the effects of different treatments on the 

crops of a fixed rotation, and 

(b) experiments comparing the effects of different rotations. 

The analysis of experiments of type (b) has been considered by Yates 
[1954]. Some points arising in the analysis of replicated experiments 
of type (a) have been discussed by Patterson [1953]. 

The present paper deals with the analysis of an experiment of 
type (a) in which the treatments were not replicated in each year, and 
which consequently gives rise to a number of problems which were 
only briefly considered by Patterson [1953]. The discussion should 
be of some general interest as non-replication is likely to be a feature 
of many rotation experiments. 

The experiment, which was carried out at Rothamsted from 1930 
to 1954, was designed to test the residual effects of certain organic and 
phosphatic treatments on the crops of a four-course rotation. The 
design, which is of great elegance, is due to R. A. Fisher. Questions 
relating to the design of this and other rotation experiments have been 
discussed by Yates [1949]. 

The four-course rotation experiment raises problems of agricultural 
interpretation as well as problems of statistical analysis. It is intended 
to publish a full account of the agricultural conclusions elsewhere. 
The present paper will be limited to a discussion of the statistical 
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methods adopted in preparing summaries suitable for the purposes of 
the agriculturalists. 

The main statistical problem with which this paper will be concerned 
is the estimation of experimental errors. Since the procedure adopted 
for the Rothamsted experiment is rather complicated, the errors of a 
number of simpler experiments will be considered first. The analysis 
of the Rothamsted experiment will then be demonstrated using the 
yield values of one crop only. 


An experiment with treatments repeated on the same plots year after year 


A simple experiment comparing five levels of a fertilizer on the crops 
of a four-course rotation will be considered first. Each crop is grown 
in each year and is followed by the next crop in the order of the rotation. 
The experiment is thus divided into four parts, usually described as 
series, each of which is at a different stage of the rotation in any year. 
The four series are kept separate in the field on four randomized blocks. 
Each block consists of five plots, one for each level of fertilizer and 
each plot receives the fertilizer allocated to it in every year. 

In any one year there is no replication of plots but over the years 
four-fold replication is provided by the different series. 

The analysis of the yield values of a single crop over 12 years will 
be considered. The yield values form one 5 X 3 table of treatments X 
years for each series. The years are of course different for the four 
series. Thus one series carries the crop in years 1, 5, and 9, another in 
years 2, 6, and 10, and so on. 

The marginal treatment totals of the two way tables are also plot 
totals. Consequently comparisons between treatment means are 
subject to plot differences. Other comparisons, derived from *the 
treatments X years interactions of individual tables do not involve 
plot differences. At least two errors are therefore required. 

The errors of the mean yields over all 12 years will be considered 
first. As in annual experiments the estimate of error is obtained from 
treatments X replicates which in this case is equivalent to treatments 
X series. Since, however, differences between series are also partly 
differences between years, treatments X series may include components 
of year-to-year variations in treatment effects produced by the influence 
of seasonal conditions. It is desirable to eliminate these components 
as far as possible. 

A suitable method is to partition the 12 degrees of freedom for 
treatments X series into components for linear responses X series, 
quadratic responses X series, and so on. Since the relationship between 
yield and level of fertilizer can usually be adequately represented by a 
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parabola, the interactions of series with cubic and quartic responses 
can be expected to be small relative to their errors and can therefore 
themselves be regarded as estimates of error. The partition of degrees 
of freedom is shown in the first part of Table 1. The whole of this part 
of the analysis is based on plot totals and the error term will be referred 
to as plot error. 


TABLE 1 
ANALYSIS OF VARIANCE 
Source d.f. | MS. 
Series (S) (Years) 3 
Treatment 4 
T XS | Linear, quadratic responses X S 6 
Remainder (plot error) 6 M, 
Plot totals | 19 
Years (Y) 8 
T X Y| Linear, quadratic responses X Y | 16 
Remainder (plot X year error) | 16 M2 
Plots X Years | 40 
Total | 59 


If the levels of fertilizer are 0, 1, 2, 3, and 4 units, the estimated 
error variance of the linear response, averaged over the twelve years, 
is M,/120. 

‘The errors of comparisons not involving plot differences can be 
determined by a similar partition of the plots X years degrees of freedom, 
as set out in the second part of Table 1. The error term in this part 
of the analysis will be referred to as plot X year error. The total 
error degrees of freedom (6 d.f. for plot error + 16 d.f. for plot X year) 
constitute the total degrees of freedom for cubic responses X years and 
quartic responses X years. 

As an example of a comparison not involving plot differences, we will 
consider the difference between the average linear responses in the first 
and last four years. This difference provides an estimate of trend in 
the linear responses to fertilizer in the course of the experiment. Since 
only plots X years interactions are involved, an estimate of the error 
variance of the difference is M,/20. 

The estimated variance in this case is not unbiased although in 
most practical circumstances it is likely to be nearly so. An unbiased 
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estimate can be obtained by further subdividing the plots X years 
part of the analysis along the lines suggested by Cochran [1939], thereby 
taking into account possible trends in the plots occurring independently 
of the treatments. Plot trends have, however, usually been found to 
be small, as pointed out by R. A. Fisher in the discussion following 
Cochran’s paper, and will therefore be ignored in the present paper. 

Similar methods of analysis are available for other cases in which 
some treatment comparisons are expected to be reasonably constant 
from year to year. Thus, for example, if the treatments form a single 
or fractional replicate of a factorial system, the errors can usually be 
obtained from years X high order interactions among the treatment 
factors. 

Cycles of treatments are often included in experiments on fixed 
rotations of crops. If the treatment and crop cycles have the same period 
there are no new problems of analysis since each time a plot carries a 
given crop it receives the same treatment. 

The remainder of this paper is concerned with an important class 
of experiments in which the treatment and crop cycles have different 
periods. Experiments of this type are capable of providing very accurate 
information on the residual effects of treatments. 


An experiment on the residual effects of manurial treatments 


As an example, we will consider an experiment comparing the direct 
and residual effects of two manurial treatments on the crops of a four- 
course rotation. 

As before there are four series, one for each phase of the rotation. 
The series are accommodated on four randomized blocks of ten plots 
each. Each manurial treatment is applied to one plot of each block in 
each year. Once a plot has been treated it receives no further treatment 
for five years when the original treatment is applied again, but this 
time to a different crop. After 20 years each plot will have been treated 
whilst growing each crop. Balance of this type, which can always be 
obtained by choosing treatment and crop cycles with numbers of years 
with no common factor, ensures that the effects of persistent differences 
in fertility are minimized. 

There are, therefore, two treatment cycles of five phases each. 
The phases of each cycle will be numbered 0, 1, --- 4 according to the 
number of years since the previous application of manure. 

The scheme for one series (before randomization) is shown in Table 2. 
Since, from the nature of the design, some plots will not have been 
treated until the fifth year, the first four years must be regarded as 
preliminary years and will not be included in the analysis. The 10 
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TABLE 2 
One SERIES OF AN EXPERIMENT ON THE RESIDUAL ErFrects oF Two MANURES 

Year Preliminary l2s8 6 8 
Crop | Ci C2: Cs Ca] Ci C2 Cs Ca Ci C2 Cs G% ... 
Plot 

2 — — —- —- —- 
3 — Mm—|—- —- - 
4 — M— —|— — — — — Mm. 
5 M—- — Mm—- —- — 
6 — — —|M: —- — 
7 — — — —- —- — 
9 — Mz — —|— — M— — — — hh. 
10 — — — — Me — 


* Mi, Mz denote applications of the two manures. 


treatment sequences are, of course, allocated to the 10 plots at random, 
but it is convenient to retain the plot numbers given in Table 2 for 


purposes of analysis. 


The analysis of the yields of crop C, over a complete crop-treatment 
cycle, i.e. twenty years, will be considered. The data for each series 
can conveniently be arranged in two Latin square tables, classified by 
plots, years, and phases, one table for each treatment cycle. One such 


table, for plots 1-5, is shown in Table 3. 


Phase-differences provide estimates of differences between the direct 
and Ist, 2nd, 3rd, and 4th year residual effects of the treatments. The 
average values of these differences over twenty years are subject only 


TABLE 3 
ARRANGEMENT OF YIELDS oF Crop C; , Piots 1-5 
Plot 
Year} 1 2 3 4 5 
Phase 
11/0 12 3 «4 
5 2s 
41 
13 0 4 
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to plot X year error since each phase of a treatment cycle occurs equally 
frequently in the plots assigned to the cycle. On the other hand, 
differences between the two treatment cycles involve plot differences 
and are, therefore, subject to plot error. 

As in the previous example, the two errors can be determined from 
suitably chosen interactions of years with treatment contrasts. Suppose 
that the 9 degrees of freedom between treatment-phase combinations 
are partitioned as follows: 

T. (1 d.f.): difference between the mean yield values over all 
phases of the two treatment cycles. 

T, (2 d.f.): the two contrasts between the mean yield in phase 
0 and the mean yield over the other four phases, 
one contrast for each treatment cycle. 

T., T;, T, (6 df.): contrasts between phases 1-4. These contrasts 

will be denoted collectively by T2-, . 

Since phases 1-4 of each treatment cycle differ only in the number of 
years since the previous application of manure, the contrasts can, in 
many cases, be expected to be reasonably constant from year to year. 
If this is the case the interactions of 7, with years can be regarded 
as estimates of error. 

As a first step we will consider the estimates of error provided by 
the data of Table 3. The analysis of variance of these data takes the 
standard form for Latin squares but with a component of T; X years 
separated in addition to plots, years, and phases. The partition of 
degrees of freedom is therefore 


Source d.f. 
Years 4 
Phases 4 
T; X Years 4° 
Plots (Plot error) 4 
Remainder (Plot X year error) 8 
Total 24 


The plot error cannot be determined directly from plot totals since 
plot totals are not orthogonal with 7, X years. It is, however, possible 
to express the difference between the totals for any two plots as the 
sum of two parts, one of which is a component of 7, X years and the 
other a component of 7... X years. 

Quantities D; and P; — D,; are required, where P; is the total yield 
on the 7th plot (¢ = 1, 2 --- 5). Each D; is defined as the difference 
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between the yield for phase 0 and the mean yield for phases 1-4 in the 
year in which plot 7 receives treatment M,. Thus, 


D; = You — + Yow + Yaw + Yaw) 
where Yow , Yiw *** are the yields for phases 0, 1, 2 --- in year w and w 
‘takes the values 1, 5, 9, 13, and 17 for i = 1, 2, 3, 4, and 5 respectively. 

Differences between the D; are, by definition, components of 7, X 
years. It can be verified that differences between the P; — D; are 
components of 7... X years. Hence the plots sum of squares is a 
function of the P; — D; . The sum of squares of the coefficients of 
the y’s in the expression for any difference between the P; — D, is 
2 X 15/4. Consequently, the sum of squares for plots is simply 

4 2 

15 2 
where R; = P; — D; — P + Dand P and D are means of the P; and 
D; respectively, over the five plots. 

If there are no real interactions between years and the contrasts 
T.-4 , the remainder mean square provides an unbiased estimate of the 
average error variance of all possible differences between the mean 
yields (over the five years) for the five phases. Since, however, the 
phases necessarily follow each other in a prescribed order, the error 
variances of individual phase-differences are not the same. Thus, 
differences between phases one or four apart, e.g. phase 0 v. phase 1, 
phase 4 v. phase 0, etc. have one error variance; differences between 
phases two or three apart, e.g. phase 0 v. phase 2, phase 4 v. phase 1, 
have another. These two variances are, however, unlikely to be very 
different and will not be separated. 

If the errors of the yields of the plots are regarded as the sum of a 
part which is constant over the years with variance o? and a part which 
varies from year to year with variance o: , the expectation of the plots 
sum of squares is 1503 + 4c% and the expectation of the remainder 
(plot X year error) sum of squares is 8c? . 

The remaining 7 Latin square tables are treated in exactly the same 
way giving totals of 32 degrees of freedom for plot error and 64 degrees 
of freedom for plot X year error. A further 18 degrees of freedom for 
plot X year error are given by the interaction T,_, X series. The 
complete analysis of variance is as shown in Table 4. 

The estimated error variance of the average value, over the twenty 
years, of the difference between two phases of a single cycle is M,/10. 

The variances of contrasts between the two cycles involve both 
plot error and plot X year error. It is convenient to express these 
variances in terms of o} and of . Thus, for example, the variance of 
the average difference between the cycles, taken over all phases and 
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TABLE 4 
ANALYSIS OF VARIANCE 
Source d.f. M.S. | Expectation 

Years (Y) 19 
Treatment cycles (7'o) 1 
Treatment phases 8 
19 
38 
Plot error 32 
Plot X year error 82 M, o 

Total 199 


years, is (50, + o2)/50, since the difference is based entirely on plot 
totals. An estimate of the variance is, therefore, ($M, — M,)/50. 
It should be noted that the validity of this estimate is not affected by 
the somewhat restrictive conditions imposed on the errors in defining 
o, and o; . In fact the randomization of the experiment ensures that 
the estimated variance is unbiased for all systems of correlations 
between the yield values. 

Further examples of the calculation of error variances in this type 
of experiment will be given in a later section. 

The above procedure can also be used for the estimation of the 
errors of other experiments of the same type over a complete crop- 
treatment cycle, provided that the interactions of years with differences 
between the residual phases (phases 1 to s — 1 where s is the number 
of phases per cycle) are small. The general expression for the plots 
sum of squares is 

} 5 
where D, is the difference between the yield of phase 0 and the mean 
yield of phases 1 to s — 1 for the treatment cycle associated with plot 7 
in the year in which plot 7 is in phase 0 and >>,,oups denotes summation 
over all groups of s plots in the same series and associated with the same 
treatment cycle. The partition and expectations of the error degrees 
of freedom are as follows: 


Source df. Expectation 


Plot error ct(s — 1) s(s — 2)o3/(s — 1) + 0% 
Plot X yearerror | ci(s — 1)(s — 3) + (c — 1X(s — 2) o 
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where ¢ is the number of treatment cycles and c is the number of crops 
such that c and s have no common factor. 

If years X residual phases are not small, the analysis is more com- 
plicated. A general method is outlined in the appendix to this paper. 
The method can also be used for experiments on cycles of s different 
treatments applied in successive years, provided that a suitable partition 
of degrees of freedom is available. 

When the treatments are factorial or involve several rates of fertilizer, 
additional estimates of error can sometimes be obtained from compo- 
nents of treatment cycles X years by an analysis along the lines de- 
scribed in the previous section. Further degrees of freedom for error 
may also be available when continuously treated plots are included in 
the experiment. The simplest method for combining the different 
estimates of error is to add corresponding sums of squares in the separate 
parts of the analysis. Strictly, greater weight should be given to plot 
errors determined entirely from plot totals than to plot errors determined 
from adjusted plot totals. Usually, however, trivial amounts of informa- 
tion are lost if this point is ignored. 


The Rothamsted four-course rotation experiment 


In the remaining sections of this paper the analysis of the results of 
an experiment carried out at Rothamsted from 1930 to 1954 will be 
considered in detail. This experiment was of the same type as the 
example of the previous section, but presents additional problems of 
analysis due to the arrangement of the plots in incomplete blocks. 

Full details of the layout and treatments were given in the Rotham- 
sted Report for 1930 (pp. 39-41 and pp. 125-127). 

The crops of the rotation were potatoes, barley, ryegrass, and wheat, 
in that order. Each crop was grown in every year in four series which 
were kept separate in the field. 

The treatments consisted of the following manurial cycles: 

D: Farmyard manure supplying 50 cwt of organic matter per acre 

applied together with supplementary fertilizers once in five years. 

C: Straw compost also supplying 50 cwt of organic matter per acre 
applied with supplementary fertilizers once in five years. The 
compost was prepared with a proprietary powder consisting of 
calcium carbonate, cyanamide, and ground rock phosphate. 

S: Raw straw applied directly to the land once in five years in an 
amount which, if composted, would have supplied 50 cwt of 
organic matter per acre. Supplementary fertilizers were applied 
in three doses in the year of application of the straw. 

P: Superphosphate applied once in five years. Supplementary 
nitrogen and potash were applied annually. 
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G@: As in P but with rock phosphate in place of superphosphate. , 

The amounts of supplementary fertilizers, which varied considerably 
from year to year, were calculated to make up the total rates of nitroger, 
phosphate, and potash applied over a five year period to 1.8 ewt N 
per acre, 1.2 ewt P.O, per acre and 3.0 ewt K,O per acre for each manurial 
cycle. 

The treatments differ, therefore, in the forms, times of applying, 
and methods of application of the three main nutrients and in the 
amounts and forms of organic matter. All of these factors have to be 
taken into account in the final assessment of the results. 

Each series was divided into five randomized blocks of five plots 
each, one plot of each block for each manurial cycle. In any one year 
five plots per series, each in a different block and associated with a 
different manurial cycle, received the treatments of the first phases of 
their respective manurial cycles. Over five years each one of these 
plots passed through all the phases of the manurial cycle allocated to it; 
in the next five years the cycle was repeated but this time starting 
with a different crop. Ignoring four preliminary years, therefore, all 
possible combinations of crops and phases of each manurial cycle were 
represented in each year; and over twenty years all combinations of 
crops and phases of one of the manurial cycles were represented on 
each plot. 

The actual layout for one series of the experiment (before randomi- 
zation) is shown in Table 5. 

It is convenient, for the purposes of analysis, to reorientate Table 5 
and the similar tables for the other series so that block numbers are in 
the body of the table and the manurial cycles define the rows. The 
revised tables for potatoes are shown in Table 6. 


Yields of potatoes 


As the potatoes produced the most interc sting .esults, the yields of 
this crop are used here to demonstrate the analysis. These yields are 
set out for the years 1934-53 in Table 7. The experiment started in 
1930 but the yields for 1930-3 were not included in the analysis because 
from the nature of the design some plots did not have their first treatment 
until 1934. 

The yields of potatoes were poor, particularly in the early years of 
the experiment. Because of the low yields, various changes were 
introduced, the most important of which was the application of addi- 
tional nitrogen to one half of each plot in 1942 and succeeding years. 
Different randomizations of the half-plots were used in each year. 
The means of the two sub-plot yields are given here. The sub-plot 
yields have only been used in the evaluation of trends. 
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TABLE 5 
Layout For Series 4 
Crop Years of first phases* 
Potatoes 1934 1938 1942 1946 1950 
Block Barley 1939 1943 1947 1951 1935 
Ryegrass 1944 1948 1952 1936 1940 
Wheat 1949 1953 1937 1941 1945 
16 Ss D G P C 
17 G C D Ss 
18 Manurial cycle} P G C S D 
19 D Ss Cc G 
20 C Ss D G r 
*excluding preliminary years 1930-33. 
TABLE 6 


ALLOCATION OF MANURIAL CycLEs TO BLocks CARRYING POTATOES 


Series 1 Series 2 
Years of 
first phases | 1936 1940 1944 1948 1952 1935 1939 1943 1947 1951 
Block Numbers Block Numbers 
D 4 3 I 5 1 9 10 6 8 7 
C 5 1 4 3 2 6 9 10 7 8 
Manurial S 3 2 1 4 5 10 7 8 9 6 
cycles P 1 4 5 2 3 7 8 9 6 10 
G 2 5 3 1 4 8 6 7 10 9 
Series 3 Series 4 
Years of 
first phases | 1937 1941 1945 1949 1953 | 1934 1938 1942 1946 1950 
Block Numbers Block Numbers 
D)\ 12 15 14 ll 13 19 16 20 17 18 
Ci il 14 13 15 12 20 17 18 19 16 
Manurial S 15 13 ll 12 14 16 20 19 18 iz 
cycles P| 14 12 15 13 11 18 19 17 16 20 
G| 13 il 12 14 15 17 18 16 20 19 
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The mean yields of potatoes over the twenty year period are shown 
graphically in Figure 1. The methods used to estimate the errors of 
comparisons between these mean yields will now be described. 


ORGANIC MANURES PHOSPHATIC FERTILIZERS. 
FARMYARD MANURE. (0). SUPERPHOSPHATE. (Pp). 
composT. (c). GAFSA ROCK PHOSPHATE. (G). 
RAW STRAW. (5S). 


POTATOES. (TONS PER ACRE) 
o 


> 
T 


1 2 3 4 
PHASES. PHASES. 


1 2 3 4 
FIG. 1 
MEAN YIELps oF PoraToEs (1934-53) 


Analysis of variance: partition of degrees of freedom 


There are altogether 24 degrees of freedom for contrasts between 
the 25 mean yields shown in Figure 1. These degrees of freedom can 
be partitioned as follows: 

T. (4 d.f.): contrasts between the mean values over all phases for 

the five manurial cycles. 

T, (5 d.f.): contrasts between the first phase (phase 0) and the 
mean values for the remaining phases (phases 1-4) 
for each manurial cycle. 

(15 d.f.): contrasts between phases 1-4. 

This partition is the same as that used in the experiment on two 
treatment cycles. In this case, however, the analysis is more complicated 
because 

(a) the interactions years X T._, include blocks and blocks X years 

components as well as plots and plots X years. 

(b) the separation of 7,_, from blocks X years is a complicated 

procedure, owing to the different frequencies with which pairs 
of treatment-phase combinations occur in the same block. 
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Thus D1 (i.e. cycle D in phase 1) and C2 occur in the same 
block in 0, 3, 0, and 0 years in the four series whilst for D1 and 
C3 the number of occurrences are 5, 0, 2, and 0. 

In order to avoid undue complication only 4 of the 15 degrees of 
freedom for T,_, have been separated. These 4 degrees of freedom are 
the largest contrasts between phases 1-4, chosen after inspection of 
Figure 1 and defined as follows: 

(1) 3y, + yz — ys — 3ys , for manurial cycles D, C, and P, and 

(2) ye — ys , for manurial cycle S. 

These contrasts will be denoted by T,. Since the mean yields for cycle 
G are almost the same in all phases no contrast for this cycle is included 
in T,. 

The remaining 11 degrees of freedom (7'; and 7.) together with the 
285 degrees of freedom for years X 7',_, constitute 16 degrees of freedom 
for blocks, 64 degrees of freedom for blocks X years and 216 degrees 
of freedom for plots and plots X years. The 100 plots of the experiment 
can be divided into groups of five plots in the same series and associated 
with the same manurial cycle. The comparisons between groups of 
plots are accounted for by 3 of the degrees of freedom for years (i.e. 
series), 4 degrees of freedom for 7’, and 12 of the degrees of freedom for 
To X Y (i.e. To X series). Of the remaining 80 degrees of freedom, 16 
are confounded with block differences. There are, therefore, 64 degrees 
of freedom for the estimation of plot error and hence 152 degrees of 
freedom for plot X year error. The full partition of degrees of freedom 
is, therefore, 


Source df. 

Years (Y) (including Series) 19 
Blocks in the series (B) 16 
Y 64 
4 

Treatments 7; 5 
4 

T XY 76 
95 
Plot error 64 
Plot X year error 152 
Total 499 


The analysis has been carried out in two parts in order to take full 
advantage of the considerable degree of balance existing in the data. 
In the first part 7, is ignored. This part of the analysis involves a 
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straightforward application of least squares. In the second part 7, 
is eliminated by means of an analysis of covariance. 


Analysis of variance (ignoring T.) 


The analysis of variance, ignoring 7’, , is set out in Table 10. Blocks 
X years is not orthogonal with 7, , 7; X Y, or plots and T, X Y is 
not orthogonal with plots. The difficulty can be overcome by replacing 
the plot totals and the block totals in each year by orthogonal functions. 

The orthogonal functions and sums of squares can be obtained by a 
least squares procedure. Suppose that plots 7 and k, which are both 
in block J, and plot j, which is in a different block (7, j,k = 1, 2 --+ 25; 

= 1,2 --- 5 fora single series) are subject to manurial cycles as follows: 

Plot 7: cycle m in phase 0 in year n, 

Plot j: cycle m in phase 0 in year n’, 

Plot k: = m’ in phase 0 in year n’, (m, m’ = 1,2--- 5;n,n’ = 1, 

- 20). 

The set of Ih adopted is 

t; : the mean level of yield, over all five phases, for cycle m in year n. 

p; : the effect of plot 7. This effect is supposed constant from year 

to year. 
d, : the difference between the level of yield in phase 0 and the mean 
level of yield over all phases for cycle m in year n. 

b, : the effect of block J, i.e. the block containing plot 7 in year n. 

The yield of the 7th plot in year n is thus represented by ¢, + p; + 
d; + 6; and a residual term. The yield of the same plot in year n’ is 
represented by t; + p; — id; + b, and a residual term. 

The normal equations for a single series are then: 


S.(t,) + 5p; d; — iS, (d;) + S,(b,) 
+d; + 2b; = D; (1) 
S,(t,) + S,(p,) + d; — iS, (d;) + 5b; B; 


for all 7, where S, denotes summation over a group of plots subject to 
the same manurial cycle, S, denotes summation over a group of plots 
in the same block, and S, denotes summation over a group of plots 
with the same phase-number in any year. P; is the total yield of 
potatoes on the ith plot. D; is the difference, in the year in which 
the manurial cycle on the ith plot is in phase 0, between the yield 
on the ith plot and the mean yield for phases 1 to 4 of the same cycle. 
B, is the total yield, in the same year, of the block containing the 7th plot. 

Contrasts between the B; — D,; for plots with the same phase- 
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number in any year are orthogonal with 7’, X Y since the d; are elimi- 
nated. Ignoring the p; , 

b; S,(b,)/5 = 4[(B; D3) S,(B; D,)/5)/15. 


‘The total sum of squares for blocks and blocks X years, ignoring plots 
but eliminating 7, X Y, is therefore 


The sum of squares for blocks is : 
(SB. — D,) SB. — 3) 


since the block effects are given by comparisons between the S,(b,). 

Contrasts between S,(P; — D;) do not involve terms in b; and are, 
therefore, orthogonal with B X< Y as well as with 7, X Y. Part of the 
sum of squares for plots, with four degrees of freedom from each series, 
is therefore given by 


4[dev’ S(P; — D,)]/75. (4) 


Other contrasts between the P; — D; are not orthogonal with B X Y. 
Since comparisons between the S,(p;) and comparisons between the 
S,(p;) involve treatment and block effects, only the deviations p/ say, 
given by 


need be considered. If P! , d/ , D! , bi and B? are defined similarly to 
then 
Spi + = Pt 
+ $di + $b; = Di (5) 
$di+ 5b! = Bi. 
Hence 


= — D) + ~ DDI 


and the remaining part of the sum of squares for plots, with 12 degrees 
of freedom from each series, is therefore 

— Di) + — (6) 

The sum of squares for B X Y (eliminating plots and 7, X Y) is 

also required so that allowance can be made in the estimation of standard 
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errors for the blocks X years components included in certain comparisons 
between the mean values of Figure 1. Owing to the symmetry of 
equations (1) and (5), this sum of squares is given by expressions (4) 
and (6) with P; and B; , P/ and B! , S, and S, , etc. interchanged. The 
plots sum of squares (ignoring B xX Y, eliminating JT, X Y), which 
provides a useful arithmetical check, described below, is given by a 
similar interchange of symbols in expressions (2) and (3). 

The arithmetical operations on the P; — D; and B; — D; are best 
carried out by setting out these quantities in two-way tables classified 
by manurial cycles and blocks. The third classification, the year in 
which phase 0 occurs, is then given by the appropriate Latin square of 
Table 6. Tables of (1) 4(B; — D,), (2) 3P; + B, — 4D, , (3) 4(P; — Di), 
and (4) 3B; + P; — 4D; for series 4 are set out in Table 8. The values 
are derived from Table 7. Thus the total yield of the compost plot in 
block 18 is 


P; = 2.8 + 2.7 + 7.5 + 6.9 + 7.2 = 27.1 tons per acre. 
The block total for the year in which the plot received compost is 
B, = 734+ 7. + 8.3 + 7.5 + 6.6 = 37.2. 
D; , the difference in that year, between phase 0 and the mean for 
phases 1-4 is given by 
4D; = 4X75 — 6.6 — 4.4 — 5.3 — 5.2 = 8.5. 

Hence 4(B; D;) is 140.3, B; 4D; is 110.0, A(P; D;) is 
99.9, and 3B; + P,; — 4D, is 130.2. Apart from rounding off errors, 
arising because the original calculations were carried out to two places 
of decimals, these are the values shown in Table 8. An ordinary Latin 
square analysis for each table, using additional divisors of 16 X 15/4 = 
60 for (1) and (3) and 9 X 10/3 = 30 for (2) and (4) yields the sums of 
squares set out in Table 9. 

These sums of squares can be conveniently denoted as follows: 


\ 
Sums of squares 
Classification 
of plots d.f. (1) (2) (3) (4) 
4(B; — D;)8P; + B; — 4D,\4(P; — 3B; + — 4D; 
Manurial cycles 4 Sa Sa Se Su 
Years 4 Sa Sy2 Sys Sy 
Blocks 4 Sn Sw Sis Su 
Remainder 12 Sn Sr Srs Su 


The sums of squares for plots, blocks X years, and blocks for series 
4 are 
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TABLE 8 
LaTtIN SQuARE TABLES For ANALYSIS OF VARIANCE OF SErRtES 4, IGNORING 7’ 
Manurial Year in which manurial cycle allocated to plot 7 is in phase 0 
Cycle 
1934 1938 1942 1946 1950 
4(B; — Di) 
D 67.5 (19)* 45.5 (16) 139.9 (20) 148.4 (17) 142.8 (18) 
ay 67.9 (20) 50.8 (17) 140.3 (18) 145.0 (19) 144.8 (16) 
Ss 73.4 (16) | 50.5 (20) 131.6 (19) 145.7 (18) | 146.3 (17) 
P 68.2 (18) 45.0 (19) 131.3 (17) 155.2 (16) 153.3 (20) 
G 76.5(17) | 46.9 (18) 95.1 (16) 145.1(20) | 150.0 (19) 
3P; + B; = 4D; 
D 99.2 (19) 85.4 (16) 110.8 (20) 119.0 (17) 119.9 (18) 
C 100.3 (20) 94.4 (17) 110.1 (18) 113.3 (19) 111.2 (16) 
Ss 91.9(16) | 93.4 (20) 123.1 (19) 127.1 (18) | 132.2 (17) 
P 101.0 (18) 101.3 (19) 118.5 (17) 124.9 (16) 140.4 (20) 
G 89.1 (17) | 85.8 (18) 100.5 (16) 111.8 (20) | 102.2 (19) 
4(P; — Di) 
D | 109.7(19) | 98.7 (16) 101.1 (20) 109.2 (17) | 112.2 (18) 
C 111.1 (20) 109.0 (17) 100.1 (18) 102.7 (19) 100.0 (16) 
S 98.0(16) | 107.7 (20) 120.3 (19) 120.8 (18) 127 .5 (17) 
P| 111.9(18) | 120.1 (19) 114.2 (17) 114.8 (16) | 136.0 (20) 
G 93.3 (17) 98.8 (18) 102.3 (16) 100.8 (20) 86.3 (19) 
3B; + P; = 4D; 
D 78.0 (19) 58.8 (16) 130.2 (20) 138.6 (17) 135.2 (18) 
C 78.7 (20) 65.3 (17) 130.2 (18) 134.4 (19) 133.6 (16) 
S 79.5 (16) 64.8 (20) 128.8 (19) 139.5 (18) 141.6 (17) 
a 79.2 (18) 63.7 (19) 127 .0 (17) 145.1 (16) 149.0 (20) 
G 80.7 (17) | 59.8 (18) 96.9 (16) 134.0 (20) | 134.1 (19) 
*(Block numbers are given in brackets.) 
Blocks (eliminating T, Y): Se + 
Blocks X years (ignoring plots, eliminating 
Plots (eliminating B X Y and T, X Y): S,; + S,. = 13.61 
Blocks X years (eliminating plots and 
Plots (ignoring B X Y, eliminating 7’, X Y): S,, + S,, = 18.81. 


The other series are treated similarly and the total sums of squares 
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TABLE 9 
ANALYsIS OF LATIN SQUARES OF TABLE 8 


Sums of squares 


Classification (1) (2) (3) (4) 
of plots df. |4(B; — D;)|38P; + B; — 4D.\4(P; — 3B; + Pi — 4D; 
Manurial cycles 4 3.35 37.15 27.43 13.06 
Years 4 713.18 118.62 2.80 829.00 
Blocks 4 3.81 7.61 3.81 7.61 
Remainder 12 20.13 12.81 16.01 16.92 


over all series calculated. The complete analysis over all series is set 
out in Table 10. 


Analysis of variance: elimination of T, 


The most convenient method for dealing with the four contrasts 

T. is to carry out an analysis of covariance on four variates x, , X2 , X3 , 
and 2, , one for each contrast. The variates take the following values: 
x, :3 in phase 1, 1 in phase 2, —1 in phase 3, and —3 in phase 4 for 
manurial cycle D in each year. 

2 and x, are similar to x, but apply to manurial cycles C and P 


respectively. 


TABLE 10 


ANALYSIS OF VARIANCE OF YIELDS OF PoTaTOES 
(1934-53) 1g¢NorING Contrasts 


Source d.f. M.S. 

Years (Y) 19 133.29 
Treatments 7'o 4 35.85 
Ti 5 28.50 

Y 76 1.01 
Blocks (B) (elim. 7; X Y) 16 5.74 
B X Y (elim. 7; X Y, ign. plots, T2) 64 0.706 
Plots (elim. 7; X Y, ign. T2) 64 0.643 
B X Y (elim. 7: X Y, plots, ign. 72) 64 0.547 
Plots (elim. 7: X Y, ign. B X Y, T2) 64 0.802 
Plots X years (ign. 7’2) 156 0.342 

Total 499 
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Z3 : 1 in phase 2 and —1 in phase 4 for manurial cycle S in each year. 
Sums of squares and products of the x’s and sums of products for the 
x’s and yield values, which will be denoted by y, are formed in an 
analogous manner to that described above for yield values. 

Since this application of the analysis of covariance has been described 
in this journal by Quenouille [1948], there is no need to give the analysis 
in detail here. The main advantages in the present analysis are that 
complicated algebraic equations which are of no general interest are 
avoided and that, if required, variations in the definition of 7, for the 
different crops can be easily introduced. 

The full analysis of variance is given in Table 11. The plots X 
years mean square is substantially reduced by the separation of T, . 
Little further improvement in this estimate of error is, however, ob- 
tained by eliminating further treatments degrees of freedom. 


TABLE 11 
ComPLETE ANALYSIS OF VARIANCE OF YIELDS OF PoTaToEs (1934-1953) 

Source d.f. MS. 

Years (Y) 19 133.29 
Treatments 7’ 4 35.85 
5 28.50 

T2 4 4.24 

Y 76 1.01 
95 1.38 
Blocks (B) (elim. 7; X Y) 16 5.74 
B X Y (elim. T; X Y, 72, ign. plots) 64 0.687 
Plots (elim. 7, X Y, B X Y, T2) 64 0.632 
B X ¥Y (elim. 7; X Y, plots, 72) 64 0.517 
Plots (elim. 7; X Y, 72, ign. B X Y) 64 0.802 
Plots X years 152 0.252 

Total 499 


Standard errors 


For the purpose of determining standard errors a blocks X years 
component of variance which will be denoted by o;, per single yield 
value is required in addition to o° and o? . 

Estimates of 7 , oj, , and o? can be obtained by equating the mean 
squares for plots X years, plots (eliminating 7, X Y, T, and B X Y), 
and B X Y (eliminating 7, X Y, plots, and 7’,) to their expectations. 
The expectation of the plots X years mean square is simply o% . The 
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expectations of the plots and B xX Y mean squares are obtained by 
replacing the y’ terms and the squares and products of the xy terms used 
in the analysis described above by the expectations of these terms. 

The expectations of the o? and o;, components of the y’ terms can 
be obtained by putting var (p) = o? , var (b) = o;, (so that var [p; — 
S.(p:)/5] = #03 , var pi; = 40% and so on) in equations (1) and (5). 
These expectations are given in the following table: 


TABLE 12 
EXPECTATIONS OF AND COMPONENTS 
Expectation of M.S. 
Source df. 5.8. (error components 
only) 

Plots (eliminating 7, X Y, B X Y, 
ignoring T'2) 16 Sys 15 65 + 0% 
48 Sr 10 + 0% 
Total 64 | Sys + Sra Sop 

B X Y (eliminating 7; X Y plots, 
ignoring T2) 16 Sa 1S aby + 
48 Sra + 
Total +o: 


The expectation of the square of the z,y term for any single degree of 
freedom is obtained by multiplying the expectation of the y’ term by 
the xj term. Similarly the expressions for products of x,y and z.y 
terms are given by multiplying the expectation of the y’ by 2,2, terms. 

The required expectations of the plots and B X Y mean squares of 


Table 11 turn out to be: 


Source 


d.f.| Expectation of M.S. 


Plots (eliminating 7; X Y, T: and B X Y) 
B X Y (eliminating 7; X Y, T2 and plots) 


64 3.43 +03 
64 3.38 oby +0: 


Consequently estimates of o° , o: , and o;, are given by 
s. = 0.252, s, = (0.632 — 0.252)/3.43 = 0.111, 
si, = (0.517 — 0.252)/3.38 = 0.078. 
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The above estimates of o; , o;, , and o; are not fully efficient. More 
efficient estimates of o; and o;, can be obtained by using weighted 
sums of squares for plots and B X Y with weights depending on the 
expectations of individual degrees of freedom. Comparisons between 
plots (or B X Y) degrees of freedom having different expectations 
provide additional estimates of o; . The gain in efficiency is, however, 
trivial as 0} and o}, are small. Even if o% and o}, were very large relative 
to o% the reduction in the variances of o} and o}, would amount to less 
than 1%. 

The contributions of 7 , «3, , and o; to the variance of any comparison 
between the means shown in Fig. 1 can be determined by assigning a 
variance of (0% + o;, + o2)/20 to a single mean, a covariance of «2/20 
to two means in different phases of a treatment cycle, and a covariance 
of o;,/80 to two means in different phases of different treatment cycles. 
The second covariance is an average value; the actual covariance varies 
for different pairs of means owing to the lack of balance in the arrange- 
ment of blocks. Since o;, is small the average covariance is sufficient 
for practical purposes. The standard error of any comparison >, m@ is 


Vas [Aso> + As)oby + ioe] (7) 


where g are the means for individual treatment phase combinations, 
m=0, = = Mi+ Mi, + 
+ Mj,andM,,M-,--- M.,M,, --- are the sums of the m 
for manurial cycles D, ( --- and phases 0, 1, --- , 4. 
Several examples of the use of equation (7) will now be considered. 
(1) The difference between phases 0 and 1 for manurial cycle G is 
—0.18 tons per acre. Here \,; = A, = 2and = 0 so that the 
standard error of the difference is /1/20(2s;, + 2s.) = 
V2 X 0.128 or +0.182 tons per acre. The difference is not, 
therefore, significant. Since plot differences do not enter into 
this type of treatment difference, the expression for the standard 
error does not include a term in s; . 
(2) The interaction (S — D) X (phase 1 — phase 2) is —0.48 tons 
per acre. The standard error is V/1/20(5s;, + 482) = +0.264, as 
= = Az = 0. 
(3) The standard error of a difference between the mean values for 
two manurial cycles is V1/20(2s; + 2s2) = V2 X 0.089, as 
A. = = 0, A; = 2. 
Standard errors for comparisons of type (1) and (3) are shown in Table 
12. For other comparisons it is convenient to use the expression 


s.e.of mg = V[0.0175\, — 0.0010, + 0.0046d;]. 
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TABLE 13 
MEAN YIELDs oF PoraToEs 1934-53. 
(Tons per acre per year) 


Manurial cycle 


Phase D Cc Ss G 
0 6.94 | 6.53 | 7.35 | 7.26 | 4.67 
1 5.93 | 5.49 | 5.62 | 6.19 | 4.85 
2 5.60 | 5.04 | 5.77 | 6.20 | 4.86 
3 5.36 | 4.90 | 5.52 | 6.01 | 4.81 
4 5.13 | 4.62 | 5.38 | 5.94 | 4.65 
Mean (0.089) | 5.79 | 5.32 | 5.92 | 6.32 | 4.77 


s.e. of a single mean for comparing different phases of one manurial cycle = +0.126. For other 
comparisons use equation (8). 


Interpretation 


The main points of agricultural interest arising out of the above 
analysis are briefly discussed below. Reference will be made to Table 13 
which, for convenience, reproduces numerically the values shown 
graphically in Fig. 1. 

1. Since the site was calcareous, responses to rock phosphate (manurial 
cycle G) over a short period were expected to be small. The experiment 
does not provide for the direct estimation of responses to rock phosphate, 
but as there is little difference between the mean yields in the different 
phases (the largest difference is 0.21 + 0.18 tons per acre) it is reasonable 
to suppose that the responses were in fact small. Differences between 
manurial cycles P and G can therefore be regarded as estimates of the 
responses to superphosphate. The direct response to superphosphate 
is given by the difference in phase 0, the first year residual response by 
the difference in phase 1, and so on. 

2. The direct response to superphosphate was large. Although they 
received much smaller quantities of supplementary fertilizers in phase 0, 
the plots receiving manurial cycle P gave as good yields as the plots 
receiving manurial cycles D or S. This result requires agricultural 
explanation. 

3. The residual responses to superphosphate were also large, being 
on the average just over one-half of the direct responses. If the responses 
to phosphate are proportional to (1 — 10~°'**), where z is the amount 
of P.O; applied, as suggested by Crowther and Yates [1941], the residual 
responses are almost the same as the direct responses which would 
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have been obtained with about 0.34 ewt instead of 1.2 ewt P.O, per acre. 
4, The mean yields for manurial cycle P showed a small decline at the 
rate of 0.094 + 0.057 tons per acre per year during the last four phases. 
Since the fourth year residual response to superphosphate is still large, 
it is of interest to determine whether the residues in the fifth, sixth, 
-++ years produce any trend with repeated applications of the treatment. 
In fact the responses of potatoes to superphosphate improved during 
the course of the experiment. This improvement cannot, however, be 
attributed with any certainty to accumulating phosphate since it is 
associated with an overall improvement in the level of yields of potatoes 
on all plots. The point will be discussed in more detail in the section 
on trends. 

5. The hypothesis which is sometimes suggested, that the residues of 
phosphate decrease by a proportional amount in each year, is not 
supported by the present data, since an exponential decay curve with 
the mean yield for manurial cycle G as limiting value gives a very 
poor fit. 

6. The compost cycle gave much poorer yields than either of the other 
two organic manurial cycles. Thus, the average differences in yield 
between cycles D and C were 0.41 + 0.212 tons per acre in the year of 
applying the organic manure and 0.49 + 0.134 tons per acre one to four 
years after application. Since on the average about one-half of the 
phosphate given in manurial cycle C was in the form of rock phosphate, 
these results are consistent with the observation that rock phosphate 
produces little or no response and that the composting has not in fact 
converted the rock phosphate into an available form. 

7. The responses to manurial cycles D and S in phases 1 to 4 were 
much smaller than the residual responses to superphosphate. This 
may be partly due to deficiencies in nitrogen and potash since the D 
and S plots were not given any fertilizers in phases 1 to 4. During 
these four phases yields for both cycles D and C decline (at rates of 
0.26 + 0.057 and 0.28 + 0.057 tons per acre per year) more rapidly 
than yields for cycle P. 

8. The difference between the yields for manurial cycle S and manurial 
cycle D in phase 2 is greater than the difference in phase 1 by 0.48 + 
0.264 tons per acre. Although it only achieves a low level of statistical 
significance here, this effect is repeated for the other crops and must be 
regarded as real. 


Long-term trends 


Trends in the yield values produced by the repeated application of 
the manurial cycles are also of considerable interest in the experiment. 
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Thus one of the aims of the experiment is to determine whether the 
plots receiving organic manures show any improvement in fertility 
relative to plots receiving fertilizers only. 
The estimates of the changes in yield adopted for the four-course 
rotation experiment are one-twelfth of the differences between the mean 
yields (adjusted, as described below, for block effects) for the last eight 
years and the corresponding mean yields for the first eight years. 
These estimates are less precise than the ordinary linear regressions 
with coefficients proportional to —19, —17, --- + 17, +19, but have 
the advantage that they are not influenced to the same extent by large 
errors, such as may well occur in the first few years of a complex rotation 
experiment, or by special seasonal conditions in the very early or late 
years. Assuming that the differences, y, in year t, between the yield 
values for different treatments follow an exponential curve of the type 
y, = a + Bp' and ignoring plot and block effects, the percentage effi- 
ciencies of the two estimates of change relative to the most accurate 
possible estimate are as follows: 


Estimate 
(1) (2) 
20 8 

Half-life ds] Dye Dy: Linear 
Years us Regression 

5 (i.e. p> = 3) 73.8 89.3 

10 83.0 96.8 

15 85.0 98.6 

20 85.7 99.2 

Linear change 86.7 100.0 


These efficiencies are obtained by comparing the ratios (expectation)’/ 
variance for the three estimates. The loss of precision in the estimate 
used here, relative to ordinary linear regression is, therefore, approxi- 
mately the same as that produced by rejecting the results of the first 
year. 

Before the changes are calculated, the yield values require adjustment 
in order to eliminate block differences. The simplest method for elimi- 
nating block differences is to subtract from each yield the appropriate 
deviation, 6, say, of the block mean (over all years) from the series 
mean. This method is satisfactory provided that (a) B X Y can be 
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ignored and (b) the effects confounded with constant differences between 
blocks are negligible. Condition (a) is satisfied as oj, is small. The 
confounded effects referred to in (b) include certain components of 
T, X Y. Although these particular components are likely to be small, 
they will be eliminated here since the quantities required for this purpose 
have already been determined in the analysis of variance. The pro- 
cedure consists simply of replacing the 5 by 


6’, say = 4[S,(B; — D,) — 4S(B; — D,))/75. 


The 6’ are very little different from the 6. Further adjustment for 
confounded effects appears, therefore, to be necessary. 

The estimates of the rates of increase in the yields of potatoes 
(excluding half-plots receiving additional nitrogen) are set out in Table 
14, 

TABLE 14 


Tue Estimates oF THE Rates or INCREASE (Tons Per AcrE Per YEAR) IN 
THE YIELDS oF PoraTors (ExcLupING Haur-PLors Recervinc Extra NITROGEN) 


Manurial cycle 
Phase D Cc Ss P G 
0 0.219 | 0.355 | 0.204 | 0.213 | 0.151 
1 0.179 | 0.249 | 0.197 | 0.209 | 0.182 
2 0.225 | 0.205 | 0.223 | 0.203 | 0.177 
3 0.199 | 0.257 | 0.226 | 0.224 | 0.178 
4 0.174 | 0.229 | 0.203 | 0.234 | 0.221 
Mean (+0.011) | 0.199 | 0.259 | 0.211 | 0.217 | 0.182 


s.e. of a single regression for comparisons between different phases of one manurial cycle = +0.030- 
For other s.e.’s see equation (9). (All s.e.’s are based on experimental error components only.) 


The standard errors provided in Table 14 were determined from 
the estimates of o° , o2 , 03, , and a half-plot component of variance 
(required because the estimates of trend in this case are partly based 
on half-plot yield values). The method of obtaining expressions for 
the standard errors is straightforward in principle and need not be 
described here. As the exact expressions are rather long, an approximate 
expression giving sufficiently accurate estimates has been determined. 
The expression is 


10° var >> mr = 915A, — 6A. — 51Asz (9) 


where the r are rates of increase, >. m = 0, and the ) are as defined for 
equation (7). 
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The most important changes indicated by Table 14 are 

(1) the improvement in the responses of potatoes to superphosphate, 
relative to the responses to rock phosphate. This has a bearing on 
point 4 in the section on interpretation. 

(2) The relative improvement of the compost plots, particularly in 
the year of applying the compost. The other two organic manures do 
not show any improvement relative to superphosphate. 

These improvements cannot, however, be attributed to cumulative 
effects of the treatments at this stage of the analysis, since external 
factors such as weather conditions, pests and diseases, management 
factors, and variability of the experimental treatments may contribute 
to the differences between the rates of increase. When the effects of 
the external factors are independent from year to year, it is sufficient 
to include in the standard errors a treatments X years component. A 
suitable procedure is given by Patterson [1953]. 

It is, however, apparent that the effects of the external factors on the 
yields of potatoes in the four-course rotation experiment were not 
independent from year to year. Table 8 shows that there has been a 
considerable improvement on all plots, including those receiving rock 
phosphate, which must be attributed to improvements in the methods 
of growing potatoes. It can be expected that the responses to the 
treatments have also, to some extent, been affected by these changes. 
Inspection of the annual differences between cycles P and G shows that 
these differences were greatest in high yielding years. The point can 
be brought out by determining the regression of the annual differences 
on the mean yields over all manurial cycles, after first eliminating time 
trends. This regression is 0.138 tons per acre for an increase of one ton 
per acre in the mean yield. Since the mean yield increased at the rate 
of 0.214 tons per acre per year, the regression accounts for a large part, 
0.214 X 0.138 = 0.0295 tons per acre per year, of the total increase 
0.035 tons per acre per year, in the annual differences between the two 
manurial cycles. 

Similarly, the high rates of increase for manurial cycle C, particularly 
in phase 0, are to a large extent associated with changes in the com- 
position of the compost. 

Thus there is no evidence that the trends (1) and (2) mentioned 
above can be attributed to cumulative long-term effects of the manures. 
In fact, it appears that such trends could equally well have occurred if 
the manurial cycles had not been repeated on the same plots. 

In general an experiment including only repeating sequences of 
treatments cannot be expected to provide satisfactory estimates of 
trends in yield values if external factors influencing the effects of treat- 
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ments themselves show marked trends. The effects of the external 
factors can, of course, be eliminated, in part at least, by means of 
multiple regression of the type mentioned above, but the resulting 
estimates are likely to be inaccurate. 


APPENDIX 


GENERAL METHOD OF ANALYSIS FOR EXPERIMENTS ON RESIDUAL 
EFFECTS 


Consider an experiment involving a rotation of ¢c crops and ¢ treat- 
ment cycles each with s phases such that c and s have no common 
factor. The experiment is laid down as described in the third section 
of the paper in c randomized blocks of ¢s plots and the results of cs 
years are available. Estimates of o; and o% are required for each crop. 

A set of orthogonal contrasts T, ,v = 1, 2, --- , s — 1 is defined for 
each treatment cycle. These contrasts can be expressed by Dou IueYu , 
where u = 0, 1, 2, --- , s — 1 are the phase numbers and )., &, is 
taken, for convenience, to be 1. 

Estimates of error will be derived from the interactions T,_, X Y, 
T.-1+1 X Y +++ and T,_,; X Y which are supposed to be small relative 
to their errors. The total sum of squares for these interactions is the 
total sum of squares for plot error and plot X year error. Usually r 
must be at least 2; when r is 1 the estimates of o} and o% can be very 
inaccurate. 

Consider the yields of a single crop on a group of s plots in the same 
series and associated with one particular treatment cycle. The phases 
of the cycle in years c apart follow a Latin square pattern in which 
columns represent the s plots and rows represent years. 

Let P be an s X 1 matrix with elements P; — P, in the order of the 
columns of the Latin square, L, an s X 1 matrix with elements 


calculated for each year w in the order of the rows of the Latin square 
and A, an s X s matrix of the /,, arranged in the Latin square pattern 
according to the phase numbers u. Then R, ans X 1 matrix of quantities 
R; is given by 


s-r-l 


R=P-— > AL= > 


v=1 v=s-—r 


The sum of squares for plots with s — 1 degrees of freedom is then 
; p:R,; where the p; are elements of 
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= (a aca.) R = (u aca.) R. 


I is the s X s identity matrix and u is an s X s matrix with all elements 
equal to 1. 

It is convenient to use the first of the alternative expressions for R 
and p when 27 > s — 1 and the second when 27 < s — 1. 

If the definitions of 7’, are the same for all treatment cycles, the 
remaining groups of plots are treated similarly using the same inverse 
s X s matrix for each group. This gives a total of ct(s — 1) degrees of 
freedom for plot error. The plot X year sum of squares is then obtained 
by subtraction. 

The expectations of the error mean squares are: 


Source dfs Expectation 
Plot error ci(s — 1) sr — 1) +03 
Plot X yearerror | cf(s — 1)(r — 1) + ér(e — 1) o 


A similar procedure is available when the number of years is less 
than cs. The Latin square is now incomplete and A, is non-square. The 
estimates of error will, however, include components of 7',_, , T,--+1 -*° 
and 7',_, . If necessary these components can be eliminated by fitting 
constants for phases. 


SUMMARY 


This paper deals with the statistical methods used to analyse the 
results of a non-replicated experiment on the residual effects of various 
organic manures and two forms of phosphate on a four-course rotation 
of crops. The discussion is mainly concerned with the estimation of 
errors. Owing to the lack of replication, the errors have to be determined 
from selected components of the year-to-year variation in treatment 
effects. The estimation of the errors is complicated by non-orthogonality 
between plot totals, blocks X years, and certain treatment comparisons 
which cannot be ignored. A least squares procedure which overcomes 
this difficulty is described. The estimation and interpretation of 
trends in the yield values are also briefly considered. 


A FOUR-COURSE ROTATION EXPERIMENT 
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THE ANALYSIS OF A TWO PHASE EXPERIMENT 


R. N. Curnow 


Agricultural Research Council Unit of Statistics 


University of Aberdeen 
Aberdeen, Scotland 


Introduction 


In a recent paper G. A. McIntyre [McIntyre, 1955] discussed the 
design and analysis of two phase experiments. These are experiments 
in which the effects of the various treatments on the experimental units 
cannot be assessed directly and a further experiment is needed to 
estimate them. McIntyre gave examples to show when these experi- 
ments were needed and suggested various experimental designs that 
could be used for the two phases. 

Generally the analysis of these experiments is straightforward and 
follows the pattern of the first phase design. The second phase of the 
experiment is analysed only if the efficiency of this phase is of interest. 
However, McIntyre continued his paper with the discussion of a more 
complicated two phase experiment in which the best method of analysis 
is not so obvious. — 

McIntyre estimated treatment differences and tested their signifi- 
cance by an analysis based on the first phase design. This analysis does 
not completely break down the variation into its component parts. 
The more complete analysis proposed in this paper results in two sets 
of estimates of the treatment effects corresponding to the inter- and 
intra-block estimates that arise in the analysis of an incomplete block 
design. These two sets of estimates are of different accuracies and 
should therefore be combined together by a suitably weighted mean. 
MclIntyre’s method of analysis is equivalent to taking the unweighted 
mean. Although little is gained by the weighting in the example given 
by McIntyre it may be well worthwhile for the same design applied to 
different experimental material. The new analysis does involve a 
little more computing than MclIntyre’s but it is not at all difficult to 
carry out. It should therefore be used for the extra information it 
may provide. 
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Description of the Design 


The experiment was designed to compare the effects of four light 
treatments on the synthesis of mosaic virus in tobacco leaves. Eight 
tobacco plants were divided arbitrarily into two sets of four and the 
light treatments applied to each set in a Latin square arrangement with 
plants as columns and leaf positions on plants as rows. 

The effects of these light treatments were estimated in the second 
phase by injecting sap from each of the first phase leaves into half-leaves 
of certain assay plants. The number of lesions which appeared on these 
half-leaves was counted. In order to equalize variances within treat- 
ments, the square root of these counts was taken as a measure of the 
treatment effect. The design was such that the specimens of sap from 
the four leaves of the first plant of the first Latin square were the Latin 
letter treatments, and the specimens of sap from the first plant of the 
second Latin square were the Greek letter treatments in a 4 X 4 Graeco- 
Latin square formed by the half-leaves of four assay plants with, as in 
the first phase, plants as columns and leaf positions on plants as rows. 
Three further Graeco-Latin squares were used to assay the sap from 
the other six first phase plants. 


MclIntyre’s Analysis 


McIntyre presented his analysis of the experiment in Table 2 of his 
paper. He estimated treatment effects and tested their significance and 
that of the test plant leaf position effects by an analysis of the thirty-two 
totals derived from the four assay plant half-leaves corresponding to 
each of the first phase leaves. To estimate certain variance components 
and test the significance of differences between assay plants and between 
leaf positions on assay plants, he also analysed the second phase. To 
do this he extended Yates’s method of “sums and differences” for a 
single Graeco-Latin square [Yates, 1937] to a set of four such squares. 

In a query to Biometrics, H. C. Hamaker [McIntyre, 1956] pointed 
out that McIntyre had not used all the available information in estimat- 
ing the second phase whole leaf error, of (McIntyre’s notation will be 
used throughout) and that in the analysis there were more mean squares 
than variance components. Hamaker questioned whether this should 
ever occur in a correct analysis and asked how, if it does occur, the 
variance components should be estimated. In his reply McIntyre 
agreed that further information on of could be obtained from the 
experiment and suggested a recasting of his second phase analysis. In 
his new analysis he made use of the fact that a Graeco-Latin square 
with both Latin and Greek letters representing treatments is a “partially 
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balanced incomplete block design with two associate classes and with 
grouped blocks orthogonal to treatments in two directions.” The first 
phase analysis now appears in the analysis as “between squares” and 
“treatments ignoring blocks.” The rest of the analysis is “grouped 
blocks,” “residual blocks eliminating treatments,” and “sub-plot 
error.’ This new analysis has removed the apparent ambiguity of 
certain degrees of freedom occurring twice, once in the first phase 
analysis and once in the second. McIntyre showed that it was quite 
possible to have more mean squares than variance components even 
with a very simple design and suggested that when this occurred maxi- 
mum likelihood methods should be used to estimate the components. 


TABLE 1 
HaMAKER’sS ANALYSIS 
Expected Value Mean Squares 
Source df. | MS. oy P. R. R. 

Between testsquares| 1 | 41.40 16 4 sf 
Treatments 3 | 97.23| 32 4 1 1 
Test plants within 

test squares 6 | 96.44 16 4 4 1 1 
Leaf positions with- 

in test squares 6 | 63.45 16 4 1 1 
Residual error for 

test squares 15 | 30.25 + 1 1 
Assay plants within 

assay squares 12 | 75.12 8 2 1 
Leaf positions with- 

in assay squares | 12 | 35.77 8 2 1 
Between leaveserror| 36 | 10.60 $ 1 
Within leaves error | 36 | 7.60 1 


Hamaker then suggested (private correspondence) that the method 
of analysis shown in Table 1 would be a further improvement on that 
proposed by McIntyre. The last four lines of the table occur in 
MclIntyre’s recast analysis but Hamaker has rearranged “between 
squares” and “treatments ignoring blocks” so that they are now the 
original analysis of the two first phase Latin squares. This allows the 
previous first phase F-tests of treatments and test plant leaf positions to 
be made. By working up the table line by line the variance components 
can be estimated in a unique way. This method of estimation makes 
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no use of the first line of the table, representing “between test squares.” 

Even this analysis is not completely satisfactory, and further 
modifications can usefully be made. Before detailing these, a slight 
error in McIntyre’s analysis should be pointed out. The treatment 
F-value is not significant, the 5% level for 3 and 15 degrees of freedom 
being 3.29. 


The New Analysis of Variance 


MclIntyre’s description of the experiment suggests that a reasonable 
model for the square root of the count on each assay plant half-leaf 
would be the sum of various effects, some fixed and some random. The 
treatment effects and both the test and assay plant leaf position effects 
will be regarded as fixed. The variation among the test and assay 
plants and the residual variation among test plant leaves and assay 
plant leaves and half-leaves will be regarded as random. This means 
that, with MclIntyre’s notation, only o%.p. , o,.p. » TL, oA , and o% 
are true variance components. However, for the fixed effects we shall 
retain McIntyre’s convention by writing o7 , for example, for 


d 
3 2" (V, vy, 
where V,, V,, V. , and V, are the four treatment constants and V 
is their mean. 

The grouping of the eight test plants into the two sets of four was 
random. However, the subsequent flotation treatment of these two _ 
sets might have introduced differences between them. We shall there- 
fore write o%.s. for $(S, — S,)*, where S, and S, are the two fixed 
Latin square effects. Similarly the timing of the inoculation of the 
various assay plants may introduce fixed Graeco-Latin square effects 
and the variation among these effects will be represented by 04.5. - 
Clearly the model could be complicated further by the inclusion of 
various interaction effects. From the description of the experiment, 
the most likely interactions would seem to be those of both treatments 
and test plant leaf positions with Latin sets and of assay plant leaf 
positions with Graeco-Latin squares. In the analysis that follows, mean 
squares for these interactions will be extracted and compared with 
appropriate error mean squares to see if there is any obvious discrepancy 
from the assumed model. 

The difficulties encountered in the analysis of this experiment arise 
from the correlation between the two half-leaves of each whole leaf on 
the assay plant. The analysis according to the first phase design does 
not make full use of this correlation, which vanishes only if os = 0. 
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The only analysis that does is the separate analysis of the second phase 
sums and differences followed by a suitable combination of the tests 
and estimates derived from each of them. This does not mean that 
the analysis according to the first phase design is invalid, but only that 
it is inefficient. It could well be used as a preliminary analysis to 
investigate, for example, the adequacy of the assumed model. This 
was done and the 6 degrees of freedom for test plant leaf positions within 
sets were separated into 3 for leaf positions averaged over the two sets 
and 3 for the interaction of leaf positions and sets. The interaction 
mean square was larger, although not significantly larger, than the 
error mean square. This suggests that in a more detailed analysis of 
this experiment the possibility of an interaction between test plant leaf 
positions and Latin sets should not be ignored. The effects of this 
interaction will be assumed to be fixed rather than random and the 
variation between the fixed effects will be represented by o; . The 
interaction of treatments and sets mean square was also extracted but 
was found to be no larger than the error mean square. 

MclIntyre’s original analysis of variance table is confusing because 
in none of its three sections has he listed the sums of squares nor has he 
shown that they and their corresponding degrees of freedom add to 
give lines representing the total variation. He has also omitted from 
the analysis of the sums the three degrees of freedom for Graeco-Latin 
squares. A complete analysis of variance table provides a check on the 
choice of the sources of variation and also on the computing itself. 
In the proposed analysis of variance, presented in Table 2, all the 127 
degrees of freedom are accounted for and they, and the various sums 
of squares corresponding to them, are seen to add to give the two lines 
representing the total variation of the sums and differences. 

The 12 degrees of freedom for leaf positions on assay plants can be 
formally analysed as 3 for positions averaged over squares and 9 for the 
interaction of leaf positions and squares. This interaction is of the 
character of a treatment and block interaction; a priort there was no 
reason to expect it to exist, and the finding that the mean square asso- 
ciated with it was smaller than the residual error mean square for 
“sums” with 12 degrees of freedom justified the prior choice of a model 
in which no component for the interaction appears and in which the 
mean square for error has 9 + 12 = 21 degrees of freedom. MclIntyre 
extends Yates’s method of ‘sums and differences” to analyse the 
Graeco-Latin squares. However, in extending the method to more 
than one Graeco-Latin square he does not separate the sums of squares 
representing effects averaged over the squares and the interaction of 
these effects with the squares. These last interactions can be used to 
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estimate a residual mean square. Further, Yates’s method is appropriate 
only when the Latin and Greek letters represent two quite distinct 
sets of treatments. In this experiment the Latin and Greek letters are 
linked by their association with treatments and test plant leaf positions. 
The 24 degrees of freedom in the “sums” analysis for Latin and Greek 
letters within squares can therefore be partitioned as shown in Table 3. 


TABLE 3 
ParTITION OF 24 DEGREES OF FREEDOM IN ‘Sums’? ANALYSIS FOR LATIN 
AND GREEK LETTERS WITHIN SQUARES 


Source dfs Expected Value Mean Squares 
Treatments 3 [1607 + 207 + 202 + 0? 
Leaf positions (T.P.) 3 160%. p. 4202 +202 +0? 
Leaf positions (T.P.) X Sets 3 1607 + 207 + 2¢f +0? 
Residual (1) 15 207 +207 + 0? 


Latin and Greek letters within 
squares 24 |207 + 20% + 207 + 207 + 20f +07 


The method of obtaining the various sums of squares corresponding to 
treatments, leaf positions (T.P.), and leaf positions (T.P.) X sets is 
demonstrated in Table 4. To illustrate the construction of this table the 
first entry is 175.5 = (24.9 + 18.5) + (23.1 + 24.3) + (27.7 + 23.7) + 
(15.1 + 18.2), i.e. it is the total count over all assay plant whole-leaves 
which have virus from leaf 1 of the first phase on either of their half- 
leaves. The number in brackets is the first phase leaf source and it 
therefore also indicates the Graeco-Latin square involved. The sum 
of squares for treatments is 


(1127.4)? + (1227.4)? + (1288.7)? + (1250.5)° (4894.0)’ 
64 256’ 


the sum of squares for leaf positions (T.P.) is 


(649.7 + 577.5)” + (567.4 + 586.3)’ 
64 


(606.8 + 653.2)” + (623.1 + 630.0)? (4894.0)? 
+ 
64 256 


and the sum of squares for leaf positions (T.P.) X sets is 
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(649.7 — 577.5)? + (567.4 — 586.3)’ 
64 


(606.8 — 653.2) + (623.1 — 630.0)’ 
64 


The sum of squares for residual (1) is obtained by subtraction. 


TABLE 4 
METHOD OF OBTAINING SuMS OF SQUARES FOR TREATMENTS, 
Lear Positions (T.P.) anp Lear Positions (T.P.) 
X Sets (“Sums” ANALyYsIs) 


Latin | Test Plant Treatment 
Set |Leaf Position A B Cc D Total 
1 1 (1) 175.5 (5) 197.9 (9) 152.2 (13) 124.1 | 649.7 
2 (6) 123.2 (2) 150.1 (14) 145.9 (10) 148.2 | 567.4 
3 (11) 140.6 (15) 115.8 (3) 176.3 (7) 174.1 | 606.8 
4 (16) 120.4 (12) 163.8 (8) 175.1 (4) 163.8 | 623.1 
Total 559.7 627.6 649.5 610.2 
2 1 (17) 141.0 (21) 144.8 (25) 164.1 (29) 127.6 | 577.5 
2 (26) 122.9 (30) 118.0 (18) 169.1 (22) 176.3 | 586.3 
3 (31) 127.0 (27) 167.6 (23) 172.4 (19) 186.2 | 653.2 
4 (24) 176.8 (20) 169.4 (32) 133.6 (28) 150.2 | 630.0 
Total 567.7 599.8 639.2 640.3 
Total over both sets 1127.4 1227.4 1288.7 1250.5 |4894.0 


The sum of squares for Latin and Greek letters within squares is 
made up of the sums of squares within the eight sets of totals in Table 4 
corresponding to the first phase leaf numbers 


1,2,3,4; 5,6,7,8; --- 5; 29,30, 31, 32. 


As each entry in Table 4 has been totalled over the four assay plants 
and assay plant leaf positions for one or other of the Graeco-Latin 
squares and as the variation being considered is within Graeco-Latin 
squares, only the terms o/ , o7.x. , 97, 91, 4 , and o? can occur in the 
expected values of the mean squares of Table 3. The four treatment 
totals have been summed over all four test plant leaf positions in both 
first phase sets and therefore the expected value of the treatments 
mean square does not contain o}.,. or o; . Similarly the eight test 
plant leaf position totals have been summed over all four treatments so 
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that the expected values of the mean squares for leaf positions (T.P.) 
and leaf positions (T.P.) X sets have no terms in oj. The coefficients 
in the expected values of the mean squares can be obtained in the 
usual way by writing out the composition of each total in detail, squaring, 
and then taking expectations. 

In the analysis of the differences the 4 degrees of freedom for alphabets 
have been split into 1 for Latin sets and 3 for Graeco-Latin squares. 
The Latin and Greek letters within squares sum of squares has been 
split exactly as in the “sums” analysis except that the Latin set 2 differ- 
ences are subtracted from, instead of added to, the Latin set 1 differences 
in arriving at the four treatment totals and the four leaf position (T.P.) 
totals. Both of these sets of four totals add to zero, so that no adjust- 
ment for the mean is necessary in calculating sums of squares. An 
adjustment for the mean is necessary for the leaf positions (T.P.) X sets 
sum of squares which is based on the sum of the two corresponding leaf 
position (T.P.) differences, one from each of the two Latin sets: The 
expected values of the mean squares are as in Table 3 except for the 
omission of the oj terms. Test plants are confounded with Graeco- 


Latin squares in the “sums” analysis but not in the “differences” 
analysis. 


Interpretation of the Analysis 


As in McIntyre’s analysis the differences between assay plants and 
between the leaf positions on the assay plants (now averaged over 
squares) were both significant. The treatment effects and the leaf 
position (T.P.) effects can be tested against an appropriate residual 
mean square in both the “‘sums” analysis and the “differences” analysis. 
They were not significant in either of them. The leaf positions (T.P.) X 
sets interaction can also be tested in both analyses; it was significant 
at 5% in the “differences” analysis but was not significant in the ‘‘sums”’ 
analysis. The residual Latin square variation, represented by o[ , 
can now be tested; it was significant in the “sums” analysis but not in 
the “differences” analysis. It may be possible to combine the results 
of the significance tests from the two halves of the analysis using the 
method recently proposed by M. Zelen [Zelen, 1957]. 

A mean square was extracted from both analyses for the interaction 
of treatments and Latin sets. Just as in the preliminary analysis of the 


first phase there was no indication that any allowance need be made 
for this interaction. 


Estimation of treatment effects 


The main interest must lie in the estimation of the treatment effects. 
In the “sums” and “differences” analyses shown in Table 2 there are 
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two mean squares for treatments each with 3 d.f., neither of which is 
significant. There exist estimates of the treatment effects corresponding 
to each of these two mean squares. Apart from being correlated through 
the errors in the first phase, these two sets of estimates are very similar 
to the “inter-block” and “‘intra-block”’ treatment estimates that arise 
in the analysis of an incomplete block design. The two sets of estimates 
may well be of different accuracies because one is based on sums and 
the other on differences. McIntyre uses the Latin square analysis to 
estimate treatment effects. This is equivalent to taking the unweighted 
mean of the two sets. If the accuracies of the two sets are different 
and if these accuracies can be estimated without too much error then 
it would be better to weight the estimates according to their accuracies. 
This is all slightly complicated by the correlation through the first phase 
error, but an estimate with minimum variance can still be derived. 
Table 5 shows the treatment means (in half-leaf units) for both the 
“sums” and the “differences” analyses and also for McIntyre’s analysis 
together with their respective standard errors. These standard errors 


are only appropriate for contrasts between the treatment estimates as 
the estimates are correlated. 


TABLE 5 
TREATMENT MEANS AND STANDARD EgRors (HAtF-LEaF UNits) 


Treatment Mean Variance of Difference 


Between Any Two 


Estimated S.E. of Diff- 
erence Between Any 


Analysis a b c d Treatment Means /|Two Treatment Means 
Sums 16.11 19.24 21.15 19.96) 3(207 + 202 + o?) 1.59 
Differences}17.30 18.76 20.04 20.37) 3(207 +2) 1.49 
McIntyre |16.71 19.00 20.59 20.17] + +0?) 1.35 


(Each standard error has 15 degrees of freedom) 


Let S and D denote the estimates of the difference between the 
effects of any two treatments from the “‘sums”’ and ‘‘differences” analysis 
respectively. Then 


V(S) = (207, + 204 + o°)/8, (1) 
V(D) = (201 + «9/8. (2) 
Also, from the last line of the above table, 
= ris) + + 2008, 


= (40, + o4 + 0/16, 
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eliminating V(S) and V(D) from (8) using (1) and (2), the covariance is 
C(S, D) = (207)/8. (4) 


Now the general weighted mean of S and D estimating the difference 
between any two particular treatment effects can be written aS + 
(1 — a)D and 

V[aS + (1 — a)D] = a’ V(S) + 2a(l — a)C(S, D) + (1 — a)’ V(D), 
which is minimized by choosing 


VD-CS,D 
V(S) + V(D) — 2C(S, D) + 


Therefore the estimate with least variance is 


oS + (204 + 
+ 


Qa 


Its variance is 

4 + oe) 

4 16(o4 + 
The efficiency of McIntyre’s estimate relative to this ‘‘best” estimate is 
+ 


2 2 4 


+ 
4o, tog tor (40, + + + 


If both of/o2 and oj/o7 are large and it is possible to estimate 
the relevant variance components with reasonable precision, then it may 
be well worthwhile to use the weighted mean. In this example, o3/o% 
and o;/o; are both small, and the estimated standard error of the ‘‘best’’ 
estimate, 1.35, is identical with the standard error of MclIntyre’s 
estimates. In other words, in this example the estimated efficiency of 
MclIntyre’s method of estimation is nearly 100%. 

If the variance components, needed to weight the two sets of esti- 
mates, cannot be estimated with sufficient precision then the weights 
will have to be chosen quite independently of the results of the analysis. 
MclIntyre’s method, which is equivalent to choosing equal weights for 
the two sets, is the best when oj can be assumed to be small compared 
with of . If, on the other hand, cf can be assumed to be much larger 
than o% , it would be better to give zero weight to the ‘‘sums” estimates 
and use only the “differences” estimates. This is obviously equivalent 
to ignoring ‘‘inter-whole leaf” information. 
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Estimation of Variance Components 


Table 6 shows the estimates of the variance components which can 
be obtained by equating the mean squares to their expected values. 
MclIntyre’s estimates, using his revised values for of and a? , are also 
shown. 


TABLE 6 
ESTIMATION OF VARIANCE COMPONENTS 
Variance Component | Sums Analysis | Differences Analysis | McIntyre’s Analysis 
7.60 7.63 
o2 1.28 a 2.21 
3.40 2.96 
ae 8.12 — 7.88 
2.72 ome 
of 10.48 1.22 5.10 
2 

0.58 2.08 

0.36 0.83 
on 0. 49 


The information on o7 contained in the two halves of the analysis 
can be combined in a way analogous to that used in combining the 
information on the treatment effects. The estimate of o7 from the 
sums analysis can be written 


(oot + 20% + g?) — (20% | 


and that from the differences analysis 


where all four x’ variates are independent and x7,;, , for example, 
represents a x’ variate with 15 degrees of freedom. With at least 15 
degrees of freedom for each x’, it seems reasonable to estimate o7 by 
an expression of the form 


61, = aS; + (1 — 


in which a is chosen to minimize the variance of ¢7 . Sj and D? are 
uncorrelated so that the variance of ¢7 is minimized by weighting S7 
and D/ inversely as their variances. Now 
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+ + 0%)? 2(20% + 0%)” 
7 2 a A € 
4V(S7) 


and 


2(207, + 4 2(a%)” 
15 36 


4V(Di) = 


and so using the estimates of = 7.60, 0{ = 1.28, 07 = 3(10.48 + 1.22) = 
5.85, we have 


4V(Si) = 73.54 and 4V(D{) = 52.88. 
Therefore, we have approximately 


OL 12 aad 5.08. 


The best way to combine the two estimates of o}.,. and o; is not 
so obvious. There is clearly also some extra information on oj to be 
obtained by differencing the two leaf position (T.P.), the two leaf 
positions (T.P.) X sets, and the two treatment mean squares, which 
occur in the analysis of Table 2. A complete maximum likelihood 
solution for the estimates of the various variance components would 
probably be rather difficult. Perhaps the most practical method would 
be some form of weighted least squares solution of the equations obtained 
by equating the mean squares to their expected values. As the weights 
needed to do this should involve the variance components themselves, 
some process of iteration would be required. Whatever method is 
used, the estimates of many of the variance components are likely to be 
subject to considerable error. As McIntyre points out, because of these 
errors of estimation the estimated values of the variance components 
can be used only for illustrative purposes; his discussion of the relative 
efficiency of alternative designs should therefore be treated with caution. 
A considerable amount of replication over all the random effects is 
needed if the analysis of a single experiment is to be used to estimate 
the effect of modifications in the experimental design. 

For the example given by McIntyre, the new method of analysis does 
not increase the efficiency of estimation of the treatment effects, nor 
does it provide very different estimates of the variance components. 
In any experiment in which the correlation between the half-leaves of 
the assay plants and the ratio of the second phase to first phase whole 
leaf errors are both greater, the more efficient analysis may represent a 
considerable gain. In any event, the author considers the method of 
analysis he has described to be not very much more laborious computa- 
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tionally than that used by McIntyre and to be easily understandable 
by analogy with more familiar types of design. 
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ANALYSIS OF QUADRUPLE RECTANGULAR LATTICE 
DESIGNS* 


JouN Leroy Foiks 


Towa State College 
Ames, Iowa, U.S.A. 


1. Introduction. The rectangular lattice designs are very useful in 
the comparison of treatments because the number of treatments that 
can be accommodated is k(k — 1) which is intermediate between k 
and (k — 1)? for which numbers of treatments ordinary lattice designs 
are available. The analyses of simple and triple rectangular lattices 
are contained in the literature (Harshbarger,, Cochran and Cox,’ 
Kempthorne,’ and Robinson and Watson‘). It is desirable in some 
cases to use four replicates and hence four different confoundings rather 
than to repeat a simple lattice design with two different confoundings, 
so that the analysis of quadruple rectangular lattice designs needs to 
be recorded for the use of experimenters. 

The design for k(k — 1) treatments in blocks of (k — 1) plots may be 
constructed by making use of a Graeco-Latin square. Ak X k Graeco- 
Latin square is constructed in which every letter of both languages 
appears in the leading diagonal. This can always be done when k is a 
prime or a power of a prime. The leading diagonal is deleted and the 
four basic replicates are constructed by confounding treatments with 
rows, columns, language 1, and language 2, respectively. The four 
basic replicates may each be replicated r times. 

The allocation to blocks of groups of treatments which are to be 
together in a block should be made randomly and the treatments should 
then be randomized within each block. 

2. Estimates of Treatment Differences. There are four basic replicates 
and we shall denote them by X, Y, Z, and V, according to the following 
confounding: 

X Rows of Graeco-Latin square confounded, 

Y Columns of Graeco-Latin square confounded, 

Z First language of Graeco-Latin square confounded, 


*Journal Paper No. J-3465 of the Iowa Agricultural and Home Economics Experiment Station, 
Ames, Iowa. Project 890. 
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V Second language of Graeco-Latin square confounded. 

It is convenient to designate a treatment by four subscripts (ghij) 
which denote the row, column, first language letter, and the second 
language letter. The true additive effect of treatment (ghij) will be 
denoted by 7,,:;; . We denote the yield in the different replicates as 
follows: X replicate by: x,:;; , Y replicate by: y,.;; , Z replicate by: 
aNd V replicate by: v,,;; 


Let 
a, = 
ghii » 
(1) 
and 


1 
6; = Tohii 


where g © h,g i,g j,h h j, t ¥ j over all summations. 
Such inequalities hold for summations throughout the paper. 

The derivation is a simple extension of that by Kempthorne and his 
notation is used exclusively so that only an outline is given. 

Thus with the assumption of additivity of treatment effects and the 
actual randomization, it follows that the best linear unbiased estimates 
of the 7,,;;’8 are obtained by minimizing the following quantity: 


[> Grasse Toms + a,)” 


s ghii 


ghii 


ghii 


ghii 


(k — 


4 
a 
{ 
a 
| 
Ale 
er 
if 
i 
| 
! 
f 
a 
2) 
4 
3 
| 


76 BIOMETRICS, MARCH 1959 


This yields the estimate of 


» Lmnpa. + Ymnpe. + + Umape. — y 


+ [dem + + d,, + (3) 


where 


and 


In other words, the estimate of tn.», is given by the usual estimate 
plus four corrections. 


Computationally, it is easier to work in terms of adjusted treatment 
totals. We find the adjustments to the observed total yields to be: 


— W’) dn/W = (W — — 4Z...... 


— — + Te... — 0... 
— DEW + W’) W’)) 

with similar expressions for 


r(W — W’) 
W 


(5) 


— W’) d 


2p) 


7 
ond 
where 7’,.... equals total over the whole experiment of treatments 
appearing in block 7 of the X replicates, X;.... equals the total over the X 
replicates of the z-th block totals, etc. The first term in expression (5) 
contains a term which makes no contribution to the whole adjustment 
and may be deleted. The term is 


However, we use expression (5) in deriving the analysis of variance and 
for that purpose, the term should not be omitted. 


3 
= 1/0? + (k — 1)02) 
| 
| 
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Let 
Ci. = T,.... 44,....5 Cam Pa... 4¥ 2... , 
and 
8. = + Cyn + Con + Con - 
The adjustment to the observed total yield is 
(ACL, — wS,) + — + — + — 
where 


r 


3. The Analysis of Variance. The analysis of variance has the 
general structure shown in Table 1. 


TABLE 1 
ANALYSIS OF VARIANCE WITH r REPETITIONS OF QUADRUPLE RECTANGULAR 
LatTTICE 
Source dfs Mean Expectation 
Square of Mean Square 
Replicates 4r —1 
Blocks Component a |4(r — 1)(k — 1) B 4r—1 = 
Blocks Component b |4(k — 1) — 
Treatments k(k—1)-1 
(Ignoring Blocks) 
Error (8r-—1)k +1] o 
Total 4rk(k —1)—1 


Blocks component a is the interaction of blocks and replicates 
with the same block arrangement. Blocks component b is found by 
subtracting blocks component a from the sum of squares for blocks 
within replicates eliminating treatments. The latter is given by 


block total—sum of treatment means 
>> d,,. | over whole experiment of treatments 
occurring in the block. 


plus similar terms for d,,, , d,;, ,and d,;, where ~ denotes the intrablock 
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estimates of the block parameters. The intrablock estimates are given 
by such terms as 


From equation (4) 


where d,, is found by letting W’ = 0 in equation (5). Using equations 
(6), we see that 


1 4 
S, 


3k — 4 3k(8k — 4) 


The sum of squares for blocks eliminating treatments is given by 


analagous terms 
which equals block component a plus 


analagous terms 
4r + Y, Z, and V 


Cre 
rd, = 


That is, block component b is equal to 


1 4 
r(3k — 4) 4) 


plus analagous terms so that it is seen that the sum of squares for 
component 6 is given by 


The weights are estimated by 
w=1/E, w’ = (4r — 1)/(4rB BE). 
4. The Variance of Treatment Differences. To obtain the variance 


Of — We follow the procedure outlined by Kempthorne. 
We find this variance to be equal to 


1 2 2 2 2 
4r(3k 4) (Crs + + + 
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2 W-W') 2 
2B 
+ (1 + (1 (A — B)(A 3B) {a (9) 


+ (1 Bn’) + (1 5 yp’) + (1 


where 
A=rk-—1)(8W+W’) and B=r(W- W’). 
Making use of equation (9), we get the following variances: 


The 2 Treatments Occurring Together in a Block 
1 subscript alike: E (9k — 3)W (3k — 9)W ] 


E 4 (9k 2)W + (3k ow | 
2 subscripts alike: 4rW 


(10) 
(9k — 1)W + (3k ww" | 
3 subscripts alike: E A 
(QkW) + (3k — |. 
4 subscripts alike: rw E + A 
The 2 Treatments Not Occurring Together in a Block 
0 subscripts alike: E (12k — + | 
4rW 
2 subscripts alike: E + (12k — 2)W + (11) 
4rW 
(12k — 1)W + (4k — | 
3 subscripts alike: 
4 subscripts alike: + 
where 
r(W — W’) 


(8k — + + (k — 4)W’) 


W-W’ 
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The mean variance may be obtained by noting the frequency with 
which each of the nine possible variances occurs and then computing 
the weighted average. For k = 4, all pairs of treatments have 4 letters 
in common. 

An approximate average variance is computed by following the usual 
procedure of assuming that 4(k — 2) of the comparisons have variance 


@ E 4 (9k 3)W + Bk — | 
4rW A 


and that the remainder have variance 


E 4 (12k — 4)W + (4k — 
4rW A 


This gives an approximate average variance of 


4rW (ke 


5. Efficiency Considerations. In judging the efficiency of the design, 
it may be of interest to make several comparisons. 
Comparisons Utilizing Interblock Information 
(1) Ratio of mean variance of quadruple rectangular lattice to 
that of a complete randomized block design. 
(2) Ratio of mean variance of quadruple rectangular lattice to that 
of a simple rectangular lattice replicated twice. 
Comparisons Utilizing Only Intrablock Information 
(3) Intrablock Relative Efficiency of Quadruple Lattice. Ratio of 
mean variance of complete block design to that of a quadruple 
rectangular lattice. 
(4) Efficiency Factor of Quadruple Lattice 


(12) 


Relative Efficiency ; 
W (Error variance of complete block design) 


It may also be meaningful to compare these quantities with the efficiency 
factor and relative efficiency of the simple rectangular lattice replicated 
twice. Usually not all these comparisons are made. However, we feel 
that the question of efficiency cannot be resolved by use of a single 
criterion. 

6. Numerical Example. A quadruple rectangular lattice design was 
constructed by making use of the following Graeco-Latin square. 

The data in the following table are artificial data. They were 
obtained by adding block effects to uniformity trial data. Some real 
data were available for use as an example. However, the adjustments 
were too small to make the example instructive. 


a 
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1 2 3 4 
4 2 
4 J\1 3 
3 
2 4 1 
4 
3 1 2 WA 
2 3 
TABLE 2 
YIELDS 
X Replicate 
Block 
1 16.8 (1234) 15.7 (1342) 17.4 (1423) 
2 15.3 (2143) 14.1 (2314) 16.5 (2431) 
3 14.1 (3124) 14.3 (3241) 15.6 (3412) 
4 13.9 (4132) 13.9 (4213) 16.0 (4321) 
Y Replicate 
1 16.4 (2143) 15.0 (3124) 17.5 (4132) 
2 14.8 (1234) 17.2 (3241) 17.8 (4213) 
3 17.7 (1342) 18.0 (2314) 16.8 (4321) 
4 15.2 (1423) 13.2 (2431) 14.1(3412) 
Z Replicate 
3 14.6 (1234) 16.7 (2431) 17.3 (4132) 
4 14.9(1342)  16.6(2143)  14.2(3241) 
2 16.2 (1423) 19.4 (3124) 16.1 (4321) 
1 18.7 (2314) 17.3 (3412) 15.8 (4213) 
V Replicate 
4 17.1 (1234) 17.8 (2314) 17.2 (3124) 
2 17.9 (1342) 17.3 (3412) 16.4 (4132) 
3 16.5 (1423) 13.5 (2143) 16.7 (4213) 
1 16.7 (2431) 18.4 (3241) 17.8 (4821) 
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In the computing which follows, r = 1. 
a. Complete the following tables. 


TABLE 3 
TOTALS 


t Xi. Vis 
1 49.9 48.9 51.8 52.9 
2 45.9 49.8 51.7 51.6 
is 3 44.0 52.5 48.6 46.7 
4 43.8 42.5 45.7 52.1 
Total | 183.6 193.7 197.8 203.3 778.4 
TABLE 4 
TREATMENT TOTALS 
ghij 1234 1342 1423 2143 2314 2431 
63.3 66.2 65.3 61.8 68.6 63.1 
3124 3241 3421 4132 4213 4321 
65.7 64.1 64.3 65.1 64.2 66.7 
TABLE 5 
CoMPUTATION OF C’s AND S’s* 
7 1 | 194.8 192.6 197.1 193.9 | —4.8 —3.0 —10.1 —17.7 | —35.6 
2 | 193.5 191.6 197.7 195.6 9.9 —7.6 —9.1 —10.8 | —17.6 
3 194.1 201.5 191.5 191.3 18.1 —8.5 —2.9 4.5 11.2 
4 196.0 192.7 192.1 197.6 20.8 22.7 9.3 —10.8 42.0 
Total | 778.4 778.4 778.4 778.4 44.0 3.6 —12.8 —34.8 0 
*The formulas for the C’s and for S are given by equations (6). 
b. Obtain the analysis of variance. Blocks component b sum of 
squares is obtained by using equation (8). 
1 
w= = 0.5124 
1.9515 0.5124, 
3 
WwW = => 0.2679. 
7 4(3.2872) — 1.9515 


: 
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TABLE 6 
ANALYSIS OF VARIANCE 
Due to d.f. | Sum of Squares | Mean Square 
Replicates 3 17.3117 5.7706 
Blocks Componenta) O | 
Blocks Component 12 39.4469 3.2872 
Treatments 11 9.0867 0.8261 
Error 21 40.9814 1.9515 
Total 47 106.8267 


c. The adjustments for the treatment totals are found by using 
equations (7). 


_ (0.5124) — (0.2679) 
~ 8(0.5124) + 4(0.2679) 


(0.5124) — (0.2679) 
12(0.5124) 


0.00188. 
The adjustments for the treatment totals are given by: 


= 0.04728, 


TABLE 7 
ADJUSTMENTS FOR TREATMENT TOTALS 
g h 7 j 
1 | —0.16 -—0.07 -—0.41 —0.77 
2 0.50 —0.33 —0.40 —0.48 
3 0.83 -—0.42 —0.16 0.19 
4 0.90 0.99 0.36 —0.59 


d. Using sets of equations (10) and (11) compute the proper vari- 
ances. 


[8(0.5124) 5(0.2679)][12(0.5124)] _ 
(0.5124) — (0.2679) = 196.796. 


Variance for Treatments Occurring Together in a Block 


2 E 4, 36(0.5124) 
4(0.5124) 136.775 


A 


| = 1.1074. 
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Variance for Treatments Not Occurring Together in a Block 


4(0.5124) 136.775 | 11512. 


Mean Variance 


8(1.1074) + 3(1.1512) 
il 


= 1.1193. 


Utilizing intrablock information only, the variances for the quadruple 
lattice are: 
For Treatments Occurring Together in a Block 


36 
For Treatments Not Occurring Together in a Block 
48 | 
40.5124 | + Gaye] = 1-467. 
Mean Intrablock Variance 
8(1.3417 + 3(1.4637) — 1.3750. 


11 


e. Using formulas given by Kempthorne, compute the proper vari- 
ances for a simple rectangular lattice replicated twice. 
Variances for Treatments Occurring Together in a Block 


1.0788. 
Variances for Treatments Not Occurring Together in a Block 
0 subscripts alike 1.1819 
1 subscript alike 1.1926 
2 subscripts alike 1.2035. 
Mean Variance 
4(1.0788) , 2(1.1819) , 4(1.1926) (1.2035) _ 


Utilizing intrablock information only, the variances are: 
For Treatments Occurring Together in a Block 


1.3417. 
For Treatments Not Occurring Together in a Block 


‘ig 
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0 subscripts alike 1.7077 
1 subscript alike 1.8296 
2 subscripts alike 1.9516. 


Mean Intrablock Variance 


1.6411. 


Note that the more nearly complete balance of the quadruple rectangular 
lattice is reflected in lower differences of the variances than for the 
simple rectangular lattice. 

f. Estimate the error mean square had complete randomized blocks 
been used. 


_ 39.4469 + 40.9814 


33 


= 2.4372. 


sO 


2E’ 
4dr 


= 1.2186. 


g. Make efficiency comparisons. 
Comparisons Utilizing Interblock Information 

(1) Ratio of mean variance of quadruple lattice to that of a complete 
blocks arrangement. 


1.1193 
1.2186 


= 0.9185. 


(2) Ratio of mean variance of quadruple lattice to that of a simple 
lattice replicated twice. 


1.1193 
1.1503 


= 0.9731. 


The quadruple lattice seems to be slightly more efficient than either 
of the other two designs. 
Comparisons Utilizing Intrablock Information Only 

(3) Intrablock relative efficiency of quadruple lattice 


1.2186 


1.3750 = 0.8863. 


(4) Efficiency factor of quadruple lattice 


0.8863 _-0.8863 


(0.5124)(2.4372) ~ 1.2498 ~ 
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(5) Intrablock relative efficiency of simple lattice replicated twice 


1.2186 
1.6411 


= 0.7426. 


(6) Efficiency factor of simple lattice replicated twice 


0.7426 
1.2488 


= 0.5946. 


Conclusion: 


The purpose of this note was to set out for experimenters the com- 
putations involved in analyzing a quadruple rectangular lattice. The 
quadruple rectangular lattice is to be preferred over the simple rec- 
tangular lattice on two grounds: (1) that the estimates of treatment 
differences will have lower variance and (2) that the quadruple lattice 
is more nearly balanced so that there are fewer risks involved in ignoring 
the incomplete block structure when one partitions the treatment sum 
of squares as one frequently wishes to do in genetic and plant breeding 
studies. 
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AN EXAMINATION OF SOME METHODS OF COMPARING 
SEVERAL RATES OR PROPORTIONS 


MInNDEL C. SHEPS 


Department of Preventive Medicine, Harvard University Medical School 
Boston, Massachusetts, U.S.A. 


The investigation of such problems as the association between 
smoking and lung cancer, between congenital malformations and expo- 
sure to radiation, or between coronary disease and obesity, usually re- 
quires quantitative estimates of the increase in the “‘risk”’ of the condition 
under study (Sheps [1958]). Several rather different methods of meas- 
uring changed risks have been used by investigators. 

In the field trials of poliomyelitis vaccine (Francis [1955]), for ex- 
ample, the incidence of paralytic poliomyelitis per 100,000 was 16 (J.) 
among vaccinated children and 57 (J,) among those who had received 
placebo injections. The effectiveness of the vaccine was estimated as 
100(1 — J,/I,) = 100(1 — 16/57) = 71.9 per cent protection. Ham- 
mond and Horn [1954] compared death rates of smokers (D,) with the 
mortality rates observed in their sample of nonsmokers (D,), by deriving 
the ratio D,/D, . Cornfield developed a method (Cornfield [1956], 
Neyman [1955]) for estimating an analogous “relative risk” from 
retrospective data on the smoking history of men with and without 
cancer. He defined relative risk as C,/C) where C, and C, represent 
the prevalence of lung cancer among smokers and nonsmokers respec- 
tively. 

On the other hand, Berkson [1958] estimated the effect of smoking 
on mortality as the difference between two mortality rates, i.e.,D, — Do. 
For example, the standardized mortality rates from lung cancer observed 
by Doll and Hill [1956] were 1.66 and 0.07 per 1000 for heavy smokers 
and nonsmokers respectively. Rather than consider this a “relative 
risk” of 1.66/0.07 = 23.7, Berkson calculated the additional rate of 
1.66 — 0.07 = 1.59 per 1000 smokers. He contended that his comparison 
was more appropriate since it treated the increased risk as proportional 
to those who would have survived if they had not smoked, rather than 
treating the risk as proportional to the deaths. 

As will be shown below, the meaning of the described statistics 
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depends on what model is assumed to underlie the observations. Even 
the two ratios J,/7, and D,/D, may be estimates of quite different 
parameters. In this paper it is intended to consider possible models 
for these problems, to examine ratios and differences between rates in 
the light of these models, and to suggest a general method of estimating 
the parameters postulated under a new model. 


I. COMPARISONS OF TWO SAMPLES 
Model 1 


We shallassume that the frequency of the occurrence of some event 
(death, poliomyelitis, coronary disease) is observed in a random sample 
of each of r populations. Let x; be the number of occurrences observed 
among the 7; individuals in the ith sample (¢ = 1, 2, --- r). The 
purpose of the investigation is to compare the risks experienced in the 
several populations. 

A common model in such comparisons of two samples postulates 
that p; , as estimated by x,/n; , is the risk experienced by the persons 
in the ith population. According to this model, it is perhaps merely a 
matter of opinion whether a more meaningful comparison of, say 7, 
and p, , is provided by A(p) = pz. — p, or by p(p) = p2/p,. The 
comparison p(p) has an intuitive appeal to many people since it seems 
reasonable to say “the risk is doubled,” or “halved.” However, there 
is a logical difficulty connected with the use of this ratio. If p; isa 
mortality rate and g; = 1 — p; is the survival rate, the choice between 
comparing mortality rates and comparing survival rates is often arbi- 
trary. Unfortunately the value of the ratio p(qg) = q2/q: has no predic- 
table relation to the value of p(p) = p/p, as may be seen from the 
following numerical example: 


.20 .60 .85 .70 
.10 .30 .95 .75 .90 
p(p) = po/pr 2.00 2.00 2.00 0.90 0.33 0.78 
eq) = qe/qu 0.99 0.89 0.57 3.00 3.00 3.00 


Algebraically, it is obvious that if p./p, = 0, q2/q. = (1 — @p,)/(1 — 
and depends greatly on the magnitude of p, (or q,). 

The value of a ratio of two rates used to measure relative risk 
therefore depends on which particular cells of a fourfold table are 
chosen for the comparison. In contrast, the difference between rates 
is not affected by the choice of cells, since A(q) = q@ — q1 = —A(p). 
The difference is easily interpreted as an estimate of the additional 
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number of people affected, and is used in the common statistical pro- 
cedures of testing the null hypothesis p, = p, , or of making interval 
estimates. Under Model 1, therefore, the difference between rates has 
some advantages over the ratio. 


Model 2 


a) For the Smoking Problem 


According to Model 1, the individual risks p; , experienced by the 
several populations are mutually independent. This seems to be a 
reasonable assumption for samples from essentially different populations, 
as when the comparison refers to the incidence of some disease in the 
two sexes, or in people who differ genetically, or when it refers to the 
results of treatment by quite different methods. But, if we assume 
that smokers are not selected on a factor related to lung cancer, another 
approach to the smoking problem is indicated. In this case, the popula- 
tion of smokers are nonsmokers whose reactions have presumably been 
modified as a result of smoking cigarettes. Without ever smoking a 
cigarette, some men (say po) will die of lung cancer. If those who would 
otherwise escape lung cancer become heavy smokers, some of them 
(say p,) may be added to the lung cancer deaths. Then, among n, 
smokers, pon, are expected to die of lung cancer regardless of cigarettes 
and p, of the remaining (1 — p.)n, men will also die of lung cancer. 
Therefore z,/n, , the proportion of smokers dying of lung cancer is 
the result of two probabilities: pp + p,(1 — po). The expected values 
of xo/n) and x,/n, are as shown in Table 1. 


TABLE 1 


Expectep Mortauity Rates rromM LuNG CaANcER AMONG 
SMOKERS AND NonsMoKERS (MopEt 2) 


Nonsmokers Smokers 
Number of men dying E(xo) = pono E(x.) = {po + (1 — po)pe}ns 
from lung cancer = (1 — qogs)ns 


Number of men not dy- |E(no — xo) = (1 — po)no|E(ns — 2s) = (1 — pol — ps)ns 
ing from lung cancer = = 

Total no Ns 

Mortality Rate E(Do) = E(x0/no) = = po + (1 — po)ps 
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There are various ways in which one could describe, under this 
model, the risk of lung cancer experienced by a smoker. The total 
annual mortality rate is p. + (1 — po)p,. Given that a smoker belongs 
to the category of persons who would otherwise escape lung cancer, he 
now suffers a risk, p, , that he will fail to do so. Also, while nonsmokers 
have a chance, go , of escaping cancer in a given year, for smokers this 
chance is decreased by a factor g, = 1 — p, . 

As is shown in a later section of this paper, the maximum likelihood 
estimates and the intuitive estimates of the individual parameters are 
identical. Applying the formulas to the previously quoted rates (Doll 
and Hill [1956]), D. = 0.00007 and D, = 0.00166, the estimates are: 


Po = = Dy = 0.00007, 
Go = (no — 2)/no = 1 — Dy = 0.99993, 


_ (n, — 2, % 


or (= / = 0.00159 or 1.59/1000. 


8 No 


We can now consider what, under this model, is being estimated by 
the methods previously described. ‘Relative risk,’ defined as the 
ratio of the two observed mortality rates, D,/D) has an expected value 
[po + (1 — po)p.l/po = 1 + (1 — po)p./po. Let us call a compound 
ratio of this type, which includes two probabilities, and is > 1, a type 
A ratio. Since its value is considerably affected by the magnitude of 
both pp, and 7, , it is difficult to interpret. For example, an attempt to 
evaluate the effect that smoking has on several different conditions 
with different values of p, , might be misleading if based on a comparison 
of the type A ratios. 

The ratio of “survival” rates (i.e. survival in terms of Table 1), 
(1 — D,)/(1 — Dp) is an estimate of a single probability, namely of q, -. 
We shall call this a type B ratio, which is always < 1. 

The difference, D, — D, , under this model, is @of, . Although 
this value is a product of two probabilities, it is equal to #, = 1.59/1000 
in this particular example, but only because @ is nearly equal to one. 


b) For the Vaccination Problem 


In the poliomyelitis example, there is no doubt that the two samples 
of children were originally from the same population, since the treat- 
ments were assigned at random in a well controlled experiment. The 
biological process is the reverse of that postulated in the smoking 
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example. If a proportion, p, , of unvaccinated children contract 
poliomyelitis and 1 — p, do not, vaccination will presumably not affect 
the status of the 1 — p, children. On the contrary, vaccination will 
protect a fraction, 1 — p, , of the children who would otherwise have 
the disease, and would decrease the number of cases of poliomyelitis 
by a factor p, . The composition of the expected incidence rates is 
shown in Table 2. 


TABLE 2 


EXxPeEcTED INCIDENCE RATES OF POLIOMYELITIS 
By Vaccination Stratus (MopeEt 2) 


Unvaccinated Vaccinated 


Children 
with polio Pipe 


Children 


without polio (1 — pilm {(1 — pi) + pill — pr)}me = (1 — pipe)ne 


Total 


nm Ne 


E(I2) = pips 


Incidence E(\h) =m 


In this case, the ratio of the incidence rates 16/57 = 0.281 is a 
type B ratio with an expected value of p.. In terms of this model the 
field trial measure of effectiveness was an estimate of g. = 1 — p,. 
This result differs structurally from the type A ratios of “relative risk”’ 
obtained from the smoking example. Table 2 is, in a sense, the mirror 
image of Table 1. If the z; and n; — 2x; were interchanged, and if the 
parameters were redefined so as to substitute the appropriate gq for 
each p and vice versa, Table 2 would be analogous to Table 1. The 
estimates are therefore also subject to the same interchange. 

The difference between the two incidence rates J, — I, = ,(1 — #2) 
or #142 yields a value of 0.00041 or 41/100,000. This value resembles 
neither #, , #2 , nor . 

Note: As a matter of fact, it is possible that vaccination might produce 
the disease in a proportion (p;) of the (1 — p,) children. The incidence 
rate of poliomyelitis among vaccinated children would then have the 
expected value: (1 — p,)p3 + pip2. However, the data available in most 
experiments of this type would not provide a basis for estimating p; . 


Vp 
| 
} 
{ 
= 
‘ 


92 BIOMETRICS, MARCH 1959 


c) General 


Model 2 is suggested as appropriate when the difference between two 
samples is believed to consist of a change of outcome in only one of the 
subgroups, i.e., either in those who would otherwise be in the x; category 
or in those who would otherwise not fall in this category. Where Model 2 
is used, estimates of the defined p,; or g; might precede other explicit 
comparisons of two risks. 

As a preliminary step, it is necessary to identify the type B ratio, 
which yields an estimate of one of the parameters in question. It is 
clear from the examples that a type B ratio results from comparing 
the proportions of individuals in the two samples who fall into the 
category which, by postulate, is subject to the changed risk. Generally 
if the factor being studied has a deleterious effect, a comparison of 
rates of survival or “success’’ yields an estimate of the decreased chance 
of escaping the risk. If a beneficial change is under study, the ratio 
of mortality or incidence rates yields an estimate of the decreased risk 
of acquiring the condition. 

Another type of example may help to illustrate the model. In an 
early clinical trial (Medical Research Council [1948]) of streptomycin 
therapy in tuberculosis, the effect of this antibiotic was compared with 
that of bed rest only. After six months of treatment, ‘‘considerable 
improvement”’ was observed in 8 per cent (#,) of the control group 
and in 51 per cent of the others. Streptomycin therapy would be 
expected to help only those who would not improve without it (¢, = 92 
per cent). Here 7, is defined not as a risk, but as the chance of improve- 
ment. The treatment given is expected to increase the chance of 
improvement. Therefore, the ratio of “failures,” 100(1 — .51)/(1 — .08) 
= 53 per cent is a type B ratio which gives an estimate of g, . Its 
complement, 47 per cent, might be used to measure the effectiveness of 
streptomycin. The compound estimate given by the type A ratio of 
the “relative chance of improvement” is 51/8 = 6.4. The difference 
(43 per cent) between the rates is equal to the product #.4, or 
(.47) (.92)100. 


d) Maximum Likelihood Estimates 


With the model as given in Table 1, assume that each sample is 
randomly drawn from its population, and that only the sample sizes 
mM, and n, are fixed. Then the logarithm (Z) of the likelihood function 
[Mood 1950] is: 

L = 2x log po + (Mo — Xo) log (1 — po) 
+ x, log {po + (1 — po)p.} + (. — x.) log (1 — po) (1) 
+ (n, — z,) log (1 — p,). 
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The first-order partial derivatives are: 


OL 2% — Xo x1 — p,) n, — 2, 
2 
OPo Po — Po)Ps (2) 


aL x,(1 — Po) — Us 


ap. 
Setting (2) and (3) equal to zero, and solving for po and p, we have: 


(3) 


po = Lo/No , (4) 
1 Lo/No (5) 


and 


The variance-covariance matrix of po and p, (Mood |1950]) is the 
inverse of the matrix with elements —E(d°L/dp;0p;) where 7, j = 0, s. 
These elements are: 


= — 7 
aps Poo 1 — qq. @) 
N.Qo 
op, — 8) 
aL n 
Op, 1 ( ) 
The variances and covariance of the estimates therefore are: 
2 _ 
2 _ (1 = Pott) 
op, qo ( nN, + No (1 1) 
= — Pods, (12) 
No 


Since the published data on the smoking studies consist of age- 
adjusted rates, we shall apply these formulas to another example. In 
the Framingham epidemiologic study of heart disease (Dawber [1957]), ' 
during a four-year period, new cases of arteriosclerotic heart disease 
(ASHD) where observed in 4.46 per cent of the 717 patients in the two 
lowest weight categories. The standard error of this rate calculated 
in the usual way, according to equation (10), is 0.77 per cent. 
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ASHD was observed in 11.36 per cent of the 176 patients in the two 
highest weight categories. Obesity would presumably act only on 
persons who are otherwise in the ‘“‘no ASHD” category. An estimate 
of q, , the relative decrease in freedom from ASHD which is associated 
with obesity, is obtained from the type B ratio of the ‘no ASHD” 
percentages, i.e., gf = (1 — .1136)/(1 — .0446) = 0.9278 or 92.78 
per cent. The risk associated with obesity is therefore (1 — 4,)100 = 
7.22 per cent. The standard error of the rates associated with obesity 
is, from equation (11): 


0.9278 {.1136 - 
100| 9:9228 { 176 + 717 = 2.61 per cent. 


The correlation coefficient (r) for the two sets of estimates in this example 
is from equations (10), (11), and (12): r = —(.0446)(.9278)/717(.0077) 
(.0261) = —0.289. 

In contrast with 7.22 per cent as the estimated risk of ASHD asso- 
ciated with obesity, it is of interest to note that the type A ratio of 
“relative risk’? would be 11.36/4.46 = 2.55. The difference between 
the incidence rates, 6.90 per cent is equivalent to #,(1 — fo) = 
(7.22) (.9554). 

The estimates of a beneficial effect follow by analogy from the 
equations given. With the symbols as defined for Table 2, the maximum 
likelihood estimates are: 


A 

= 2,/n, and fp, 
Ny { Ne + Ny and Ny 


II. COMPARISON OF r SAMPLES WHEN r > 2 


The approach just outlined leads to a variety of possible models for 
comparisons of three or more samples. The choice of an appropriate 
model for any problem would depend on the expected effects of the 
factors under consideration, as shown in the following examples. 

(i) Consider that the results in an untreated group of patients with 
a true mortality rate p, , are being compared with the results of (r — 1) 
different treatments. Each treatment may be assumed to exert its own 
modifying effect p; . The expected mortality rates would be: 

Among untreated patients: —Pr 
Among patients with treatment 7: p,p; 
Among patients with treatment j: pp; . 
The estimates would each be derived as in Model 2 and have anal- 
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ogous variances. The only added feature is that #; and #,(%, 7 # 1) 
are correlated, with a covariance q,p,p;/pini - 

(ii) When the individuals in the several samples are exposed to 
graduated amounts of the same factor, e.g. to no cigarettes, to less than 
one pack daily, or to more than one pack daily, successive decrements 
to the survival rate might be assumed for each category of smokers, as 
shown in Table 3. 


TABLE 3 


AND SuRVIVAL RaTEs 
AMONG SEVERAL CATEGORIES OF SMOKERS 


Nonsmokers | Light Smokers Heavy Smokers 
Total annual mortality 
observed by Doll and .01325 .01492 .01884 
Hill [1956] 
Expected mortality rate pi pi + Qip2 Pi + Qip2 + Qdeps 
= 1 — 
Expected survival rate 419293 


The maximum likelihood estimates of the q; are: 
Gd. = 1 — .013825 = .98675, G2 = (1 — .01492)/.98675 = .99831, 
Gs = (1 — .01884)/(1 — .01492) = .99602. 


The variance-covariance matrix was derived as previously described. 
With the appropriate changes in notation, equations (10), (11), and (12) 
apply to the variances and covariance of 9, and g@,. The other values are: 


1492 Nz No 


= 


The @; might be interpreted as follows. Among nonsmokers the 
annual survival rate was 0.98675. For light smokers this rate was 
reduced by a factor of 0.99831. If one assumed, for illustrative purposes, 
that the effect of smoking is constant, one could estimate that after 
twenty years the survivors among light smokers would be (0.99831)”° 
or 96.7 per cent of the survivors among smokers, a reduction of 100 — 
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96.7 = 3.3 per cent. Heavy smoking further reduced the annual 
survival rate by a factor @; = 0.99602. The twenty year survivors 
would be (0.99602)*° or 92.3 per cent as compared with light smokers 
and (.923)(.967) = 89.2 per cent as compared with nonsmokers. 

(iii) An even more complicated model would apply to a comparison 
between nonsmokers, smokers, and persons who had once smoked but 
who had since then given up the habit. If the first two samples are 
defined as in Table 1, stopping might be expected to diminish the risk 
of death for the gop, former smokers by a factor, say p,. The resultant 
expectations are shown in Table 4. 


TABLE 4 


EXxpEcTED MORTALITY AND SURVIVAL RaTEsS AMONG 
NONSMOKERS, SMOKERS AND FoRMER SMOKERS 


Nonsmokers Smokers Former Smokers 
Mortality Rate Po Po + Po + 
= 1— 
Survival Rate go + 
= gol — pepe) 


In this case, estimates of p, and g, could no longer be made by simple 
division. After g and p, were estimated as previously shown, q, could 
be estimated as G, = (x,/n, — 2/nx)/Gop, , Where x,/n, is the observed 
mortality rate among former smokers. 

The additional variance and covariances are: 


nN, No 


SUMMARY AND CONCLUSIONS 


Quantitative comparisons of relative frequencies such as the mor- 
tality rates observed in several samples, are often required either as 
purely descriptive statistics or as estimates of population parameters. 
As a rule such rates are compared by calculating the ratio of two mor- 
tality rates, the ratio of two survival rates, or the difference between 
the two mortality or survival rates. 

These statistics have been analyzed in the light of several models. 
Model 1 applies to the study of different populations with independent 
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probabilities p; of the defined event. The numerical value of a ratio 
depends completely on which cells are selected for the comparison. 
The difference is unaffected by the choice of the investigator and has 
several other advantages. 

An alternative model (Model 2) with numerical examples, was 
suggested for the common situation where the factor being investigated 
is assumed to modify the reaction of the population in one direction 
only. Under this model the expected rates in the several samples are 
not independent. As a result, the difference is a product of two prob- 
abilities; the ratio between one set of rates is compounded of several 
probabilities; while the ratio between the complementary set of rates 
is the estimate of a single probability. When a deleterious effect is 
expected, the ratio of success rates is an estimate of the diminished 
chance of “success” or survival; and when a beneficial effect is expected 
the diminished risk of failure is estimated by the ratio of “failure”’ 
or mortality rates. 

Several different modifications of the model were indicated for the 
case with more than two samples. Maximum likelihood estimates of 
the parameters and their variances were given. 
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THE ANALYSIS OF EXPERIMENTS ON GROWTH RATE* 


F. B. Leecu anp M. J. R. HEAty 


Rothamsted Experimental Station 
Harpenden, Hertfordshire, England 


1. Introduction 


Consider an experiment in which several treatments are applied to 
comparable groups of growing subjects over a period of time. The 
attribute of interest to the experimenter—e.g., liveweight—is measured 
initially and at regular intervals during the experimental period. The 
results of such an experiment are most commonly considered in terms of 
average growth rate—e.g., gain in liveweight per week—and this quan- 
tity is most simply obtained by dividing the difference of the initial 
and final measurements by the duration of the experiment. Experiments 
of this type were discussed by Wishart [1938, 1939]. He pointed out that 
this method of assessing growth rate ignored any information provided by 
the intermediate measurements, and that other features of the growth 
curve besides the average slope might merit consideration. Difficulties 
arise, however, because successive measurements on the same subject 
cannot be considered independent. Wishart accordingly suggested that 
the growth curve for each subject should be broken down into its mean 
and linear, quadratic, etc., components, each of these being subjected 
to separate analysis. The effects of treatments on the average growth 
rate could be seen from the analysis of the linear components, and 
analyses of the further components would show to what extent the 
treatments were affecting the shapes of the growth curves. 

In an experiment of short duration, or when the treatment effects 
are small, a summary solely in terms of average growth rate will often 
be adequate. When a single treatment is compared with a control, this 
is equivalent to assuming that the treatment effect—the average 
difference between treated and control subjects—increases linearly with 
time. It is natural to estimate this effect from the difference between 
the linear components of the growth curves. However, in many types of 
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experiment, the subjects in the treated and the control groups will be 
comparable at the start of the experiment and will be allotted to the 
appropriate groups by some process of randomization. In this case, the 
initial difference between the two groups will be zero, apart from 
sampling errors. Any difference in growth rate will thus lead to a 
difference between the mean weights of the two groups, and consequently 
any such difference in mean weight that is observed can be used to 
provide an estimate of the relative growth rate. We thus have two 
estimates of the relative growth rate, one derived from the means and 
the other from the linear components, and an improved estimate can be 
found by combining the two, with weights depending on their variances 
and covariance. 


2. The mathematical model and tts analysis 


Suppose that n equally spaced observations are taken on each 
subject, and denote an observation in the control group at time ¢ by 
y(t), t = 0,1, — 1). Fitting orthogonal polynomials up to the 
(n — 1)th degree to the set of observations for this subject, we can write 


y(t) = Ddofolt) + bifi() + + (1) 


where f,(¢) is the value of the polynomial of degree p at time t. We 
have: 


THOM) =0, pea say. 


For a population of subjects receiving no treatment, the b’s can be 


considered to be random variables with expectations 8 , 8: , +--+ , Ba-1- 
We can fit polynomials in the same way to the observations on a sub- 
ject in the treated group and obtain coefficients cy , ¢; , +++ , C,-1 With 


expectations , 1, °** »Ya-1- Now, the assumption that the treatment 
effect is proportional to time implies that the difference between the 
expected growth curves is simply a straight line through the origin 
with slope A, say, where \ is the parameter we wish to estimate; and 
this in turn implies that 


(Yo — Bo) fo(t) Bi) =X 
=8, for p> 1. 


(3) 


Now suppose for simplicity that the experiment is carried out in ran- 
domized blocks of two subjects each. Then we can work out the 
differences between the treated and the control subject of any pair at 
each time and fit polynomials to these differences. The resulting 
coefficients dy , d, , --- , dx, are again random variables, and we have 
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E (do) = Yo — Bo; 
E (d,) = 0, p>l. 


Now, multiplying the first equation of (3) by f,(#) and using >> to 
denote a summation over values of t from 0 to (n — 1) here and through- 
out the remaining sections of this paper, we have: 


— Bo) = A tfo(t) 


(4) 


and 


so that we have two estimates of A, 


lo = ko >. tfo() 
L, ky d./>> tf,(d). 


For computing purposes, kod, and k,d, can be calculated as : i fo(t) - 6(t) 
and >> f,(t)-6(), where 4(t) is the observed difference at time ¢ in the 
pair under consideration. 

The observed differences, 6(¢), contain all the information provided 
by the pair of subjects, and they can be replaced without loss of informa- 


(5) 


tion by the polynomial coefficients d, , d, , --- , ds-1 . However, (4) 
shows that only d, and d, provide direct information on \, so that 
we may consider an estimate obtained by combining J, and 1, from (5). 
If voo , Vy; , and vo, are the variances and covariance of kod, and k,d, 
then the variances and covariance of I, and 1, are 


Voo = tfo(t)]’, 
Vi = (6) 
Va = tf,(t)]. 


It is well known that the minimum variance combination of J and J, is 
given by 


i = al, a (1 aa a)l, 
where 


a= (Vas Vor)/(Voo0 2Va1 + Vis) 


and 


VooViu — Ves 


Voo 2V 01 + 


100 
(7) 
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‘These formulae relate to a single pair of subjects. If N pairs are used, 
Voo » Vor , and V,, must all be multiplied by 1/N. The efficiency of 
the combined estimate relative to the simple estimate 1, is: 


Vis(Vo0 2V + Vi) 
V oo V Voi VooVis = Va ( ) 


To evaluate a, we need estimates of the variances and covariance. 
The simplest method is to work out >> fo(é)y(#) and >> f,(®)y(é) for each 
animal separately and to subject these values to straightforward 
analysis of variance and covariance. Uo9 , Vo, , and v,, as defined above 
will be twice the residual variances and covariance in this analysis. 

The extension to more than two treatments is immediate. We can 
evaluate for each treatment quantities given by: 


Differences between the treatment means of y)’s and of y,’s provide 
two estimates of \ for any given pair of treatments, and these are 
combined as before. vo , ¥:: , and vo; are again obtained from analyses 
of variance and covariance of >> f,(t)y(t) and >. f,(é)y(Z). 


3. Treatment effects not linear in time 


The assumption made above of a linear treatment effect is equivalent 
to assuming that treatment has no effect on the quadratic, cubic, 
-++ components of the growth curves. This assumption can conveniently 
be tested by analyses of variance applied to those components. With 
an experiment lasting for some time, the assumption will often prove 
false and a more elaborate model will then be required. Suppose, for 
example, that the treatment effect can reasonably be considered to be 
quadratic in time, so that an extra quadratic term must be inserted in 
equation (3). In the interests of further generalisation it is convenient 
to write the treatment effect in the form \¢,(t) + ud.(é), where ¢, , $2 
are polynomials in ¢ of degrees 1 and 2, passing through the origin, 
and with the property that 


DX = DL = 0. (10) 


These polynomials are most easily obtained by adding a suitable con- 
stant to the values of f, and f,. For convenience, we put 


Referring to equation (4), the expected values of the d’s are now 0 
for p > 2. The modified equation (3) now leads to: 
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— Bo) = Prod + Poot, 

— B:) = pud, (11) 

— B2) = Pook. 

Eliminating » from the first and third of these equations, we get 
— Bo) — Proks(v2 — = ProP22d 


so that we again obtain two estimates of A, 
= (i, dy — Poe k, 
P20 


L=k d,/pir 


The variances and covariance of these estimates are: 


(12) 


1 P20 
Va = E ~ v | 
01 01 Doo 12 


and formulae (7), (8), and (9) can be applied as before to obtain 1. 
Correspondingly, we have as estimates of pu 


mM = (i, dy — k, ds (14) 
11 


m, = d2/Poo 


with variances and covariance 


1 2 
Bio 
00 Pro Voo D 
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and again formulae (7), (8), and (9) can be used to obtain a combined 
estimate *. 1 and m are in fact just the estimates which would be 
obtained by applying orthodox least-squares techniques to equations 
(11). The amount of the treatment effect at different times can be 
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Pir 
Pu 
Vu = V22 (15) 
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estimated from 1, *, and the values of the polynomials. Cubic, quartic, 
-++ terms in the treatment effect can be introduced by an obvious 
extension of the above method. 


4. Values of the p;; 


The quantities p;; depend only on n, the number of observations 
on each subject, and on the particular numerical values chosen for the 
f and ¢ polynomials. We assume that the f values are taken from 
Fisher and Yates ({1957] Table XXIII) and that the ¢’s are obtained by 
adding a constant to the corresponding f’s so as to make the value of 
¢ zero att = 0. In this case, we have p;; = k, ; the k’s are included in 
Fisher and Yates’ tables. f, is given the values 1, 1, 1, --- , so that 
ky = n, while pj is n times minus the value of f; at ¢ = 0. 

As an example we take the case n = 5, and obtain: 


ko = 5 
k, = 10 
ke = 14 
ks = 10 


iw) 
| 
bo 


Pu = 10; pio = 10 
-4 -3 = 14; po = —10 
ps = 10; po = 5 


5. Covariance adjustments 


Reverting to the simple model of section 2, it has been pointed out 
that components higher than that of first degree do not contain terms 
in A, and so do not lead to estimates of this quantity. In fact their 
expectations under the assumed model are all zero. Since these higher 
components will in general be correlated with the mean and linear 
components, it may be possible to use them indirectly to increase 
precision by means of straightforward analysis of covariance (compare 
Cochran and Bliss [1948]). It seems rather unlikely in general that 
such a procedure will be worthwhile. Of greater practical importance 
is the possibility of carrying out a covariance analysis on the initial 
value, 6(0), whose expected value is also zero—this is equivalent to 
using a complicated function of all the components as a covariate. The 
correlations between 6(0) and the mean and linear components are often 
quite large and worthwhile increases in precision may result. When 
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the analysis of covariance tables and regression coefficients have been 
computed, appropriate adjustments can be made to the relevant means, 
variances, and covariances; the formulae of the paper can then be applied 
unchanged. 


6. Example 


Table 1 gives the quantities >> fo()y(#) and >> f,(éy(#) for an 
experiment on pigs in six randomized blocks. Each animal was weighed 
16 times at weekly intervals. The treatments comprised the combina- 
tions of a high and low plane of nutrition (H and L) with a control and 
two antibiotic supplements (O, P, , and P,). The residual mean squares 
and products are shown in Table 2. With 16 observations, we have 
>> tfo(t) = 120, >> tf,(t) = 680. Dividing the treatment means by these 
quantities, we obtain the values of y) and y,, in Table 3. We also 
have from (6), 


TABLE 1 


ToraL WEIGHTS AND LINEAR COMPONENTS FROM 16 WEEKLY OBSERVATIONS 
ORIGINAL OBSERVATIONS IN LB. 


Totals = >> fo(t)y(t) 


Block LO LP, LP2 HO HP, HP, 
1 1565 1625 1629 1809 1884 1609 
2 1731 1902 1760 1869 2030 2058 
3 1599 1267 1663 1910 1868 1693 
4 1280 1322 1466 1390 1552 1645 
5 1426 1667 1560 1270 1693 1673 
6 1407 1505 1372 1495 1464 1261 


Treatment 
Means 1501.3 1548.0 1575.0 | 1623.8 1748.5 1656.5 


Linear Components = 


5249 5509 5295 6343 6844 5653 
5207 5722 5506 6263 6764 6808 
5981 5077 5719 7004 7334 6367 
5098 4970 6120 6469 
5008 5341 5630 3942 6181 6073 
5511 5221 5358 5845 5888 4999 


oor 
i=) 


Treatment 
Means 5335.7 5343.3 5434.3 | 5727.8 6521.8 6061.5 
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TABLE 2 
REstpuaL MEAN SQUARES AND Propucr 


| DX folt)y(t) falt)y(t) 
Dd folt)y(t) 19585. 05 65792 .12 
Dd filt)y(t) 282599.86 


= 4534, .2037, Ves = + .2688 


so that a = —0.545. Calculating —0.545y, + 1.545y, , we obtain the 
values of 9 in Table 3. Differences of these quantities provide esti- 
mates of the growth rate for one treatment relative to another. The 
variance of such a difference is found from (8) to be .1682 and the 
efficiency of the estimates relative to those based solely on the linear 
components is thus 121 per cent. 


7. A note on least-squares estimation 


At the suggestion of a referee, we justify the derivation of 1 and 
m, based on equations (12) to (15), as the least-squares estimates from 
equations (11). These latter can be written in matrix form as 


y= Pr 


where y is a vector calculated from the observations, P a known matrix 
depending on the design of the experiment, and 2 the vector of the 
parameters to be estimated. In least-squares terminology, these are 
the observational equations; the required estimates are the solutions of 
the normal equations 


P’V'y = P’'V 
where V is the variance-covariance matrix of y. 


In replacing (11) by (12) and (14), we are in effect pre-multiplying 


TABLE 3 
TREATMENT MEANS, DIFFERENCES EsTIMATE TREATMENT EFFECTS IN LB./WEEK 


LP, HO HP, HP; 


: 13.12 13.53 14.57 
Yi 7.85 7°86 7.99 8.42 9.59 
5.19 5.64 6.88 


' 
| 
Lo LP 
13.80 a 4 
8.91 
6.24 + 0.29 
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P by a matrix A. (12) and (14) are in fact new observational equations 
which can be written: 


z = Ay = AP. 


The variance-covariance matrix of z is AVA’, so that the normal equa- 
tions become: 


P’A’(AVA’)"'Ay = 
But it is easy to show that A’(AVA’)"'A = V™ provided (A’A) is 
non-singular [pre-multiply successively by AV, A’, (A’A)~*]; so that these 


are identical with the normal equations obtained directly from the 
original observational equations. 


Summary 


A method is presented for the analysis of experiments in which 
successive measurements of the same quantity on the same organism are 
made at equal intervals of time. The specification and estimation of 
treatment effects are discussed and the method is applied to an experi- 
ment on the growth rate of pigs. 
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1. Introduction 


In his investigation into the nature of the hypothesis of no three- 
factor interaction in a 2 X 2 X 2 contingency table, Bartlett [1] found 
that in order to estimate the parameters of the underlying distribution, 
it was first necessary to solve a cubic equation in one unknown. For a 
more complex three-way table, namely the 2 X 2 X 1, the estimation 
problem was shown to involve the solution of (¢ — 1) simultaneous 
third-degree equations in as many unknowns. Norton [2] demonstrated 
a rather neat iterative procedure for solving these systems of equations. 
More recently, Roy and Kastenbaum [3] extended the no three-factor 
interaction hypothesis to the general three-way contingency table, 
and in so doing, found that the estimation of the pararteters in this 
case involved the solution of (r — 1)(s — 1)(¢ — 1) simultaneous 
third-degree equations in as many unknowns. 

It is the purpose of this paper to demonstrate a technique which, 
while practical with an ordinary desk calculator, is particularly well 
suited for modern high-speed computers. The technique involves the 
superimposition of Newton’s method of functional iteration onto the 
procedure suggested by Norton [2]. 


2. Preliminaries 


The no three-factor interaction hypothesis for an r X s X ¢ table 
is given [3] by: 
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where = 1,2,--- , = 1,2,---, (8 —1;k =1,2,---, 
(¢ — 1), and where the p,;,’s are the parameters of the multinomial 
distribution: 


N! r s t 


t=1 j=1 k=1 


r s t r 8 t 


D Nie = N, and = i. 
t=1 j=1 kel t=1 j=1 k=1 
Under the null hypothesis, estimates of the parameters may be 
achieved by first solving the following simultaneous systems of third- 
degree equations for all 2;;, : 


(2.3) 


s—1 


j=1 


(msn + > + rin) 


t=1 


where = 1,2, ---, (7 = 1,2, ---, (8 —1);k = 1,2, --- , 2, 
and where 


t-1 t 
» OF = 0. (2.4) 


k=1 k=1 
3. Iterative Solution 


To solve equations (2.3), first let all z,;, = 0, except for one set: 


Tir (Say). 
Then the general term of (2.3) reduces to 


Ge — — Lire) 
(Misk + Liz) (Mere + ix) @.1) 


fork = 1, 2, --- , t. It follows, by direct application of Norton’s [2] 
procedure that 


a) a) 
Call — hiv (3.2) 


where 


for 
r-1 e-1 
t=1 j=1 
4 
a 
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(i) the superscript (a) refers to the ath correction, 


m 


a=1 
(iii) m is the number of iterations necessary to make 


x", = 0 with desired accuracy, 


b Ny okNrik 
? 
Ny 


1 1 1 1 ) 
+ Ni sk + Nrik 


t t 
k= k=1 


Equation (3.2) provides first approximations for the z,,,._ These values 
of xi} are now either added to or subtracted from the observed cell 
frequencies with which they are associated according as equation (2.3) 
specifies. After this set of corrections has been applied, the iteration 
continues with the next set of equations involving 2,., (say). Again, 
set all 2;;, = 0 except for the 2,2, , solve for x{3} , apply these corrections 
to the appropriate cell frequencies, and continue this procedure for all 
vi, [¢ = 1,2, ---,(r — 1);7 = 1, 2, --- , (8 — 1)]. When all the first 


corrections have been determined and applied, the iteration begins 
2) 


again with x{7, , and continues until all the x", = 0 with desired accuracy. 
In applying this proposed extension it is wise to take cognizance of 
the following five points noted by Norton ([2], pp. 254-255): 


(i) The values of x‘*} as obtained in (3.2) must satisfy (2.4) for 
all 7, j, and a. 

(ii) c,;;, need not be recalculated at each iteration. 

(iii) The number of figures necessary in the arithmetic may be found 
by a consideration of (3.1). 

(iv) A partial check on c;;, derives from the fact that ¢,;;, cannot 
exceed the smallest of the observed numbers from which it is 
calculated. 

(v) The x,;;, must be numerically smaller than their respective n;;, . 


All of these points apply to the procedure outlined in the present paper. 
However, if a high-speed digital computer is to be used for any specific 
problem, and if its capacity is such that it can carry more than the 
minimum number of decimal places required by the arithmetic, then 
specific instructions concerning Norton’s first four points need not be 
given to the computer. 
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The fifth point, however, is a necessary condition rather than a 
suggestion. In proceeding from equation (3.1) to approximation (3.2), 
the quantities (1 + 2;;,/n;;,)~* are approximated by the first two 
terms of the series 


(ise) _ Zu) 

Nik + (24) + 

This approximation holds only if (z;;./n;;,)?> < 1. To satisfy the 
fifth condition it may therefore be necessary sometimes to estimate 
first corrections before applying equation (3.2). These corrections 
will be necessary only when the expected value of a particular cell in the 
contingency table exceeds the observed value by at least a factor of two. 


This will always be true for a vacant cell, and occasionally for cells 
which are sparsely filled. 


4. Calculation of the Test Statistic 


Let 
e-l r-1 s-1 
j=1 k=l 
s-l t-1 
j=1 k=1 j=1 
r-1 t-1 r-1 
Brit = Viik » = Viik » 
i=1 k=l i=1 
t-1 
= Viik » Mijn = - 
k=1 
Then the statistic 
r t 
2 2 
X= H+ (4.2) 


t=1 j=1 kel 


is distributed approximately as chi square with (r — 1)(s — 1)(¢ — 1) 
degrees of freedom, where 

Yisx = +1, if 77k = rst or if any two subscripts differ from rst; 
and 

Yiie = —1, if only one or all three subscripts differ from rst. 
Equation (4.2) provides a test statistic for testing H, in (2.1). 


5. Illustrative Example 


The data in Table 1 represent a portion of an experiment performed 
at the Oak Ridge National Laboratory. For this experiment litters of 


j 
| 
| 
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TABLE 1 


NuMBER oF LITTERS OBSERVED CLASSIFIED BY LITTER S1zESs, TREATMENT, 
AND NUMBER OF DEPLETIONS 


Number of Depletions 


Litter Treatment 


2 or More 


11 5 


(121) (131) 


19 


(221) (231) 


14 


(122) (132) 


17 


(222) (232) 


(123) (133) 


(223) (233) 


(124) (134) 


(224) (234) 


(125) (135) 


(225) (235) 


mice of various sizes were treated in either one of two different ways, 
and the number of deaths per litter before weaning was observed. 

For each of the two treatments a chi-square statistic with eight 
degrees of freedom was computed to test the hypothesis that the number 
of depletions is independent of litter size. This hypothesis was rejected 
for both treatments. In addition, for each of the five litter sizes, chi- 
square with 2 degrees of freedom was computed to test the hypothesis 
that the number of depletions is independent of the treatments. In 


111 
(111) 
B | 75 = 7 
(211) 
A 49 = 10 ae 
B 58 = 8 
A 33 18 15 en 
(113) 
B 45 22 10 eo 
(213) 
A 15 13 15 ae 
10 
B 39 22 18 pr 
(214) 
A 4 12 17 | 
(115) 
B 5 15 8 
(215) 
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each case, there was no reason to reject this hypothesis. From these 
two sets of tests, one may conclude first that the number of depletions 
is dependent upon litter sizes for both treatment A and treatment B, 
and secondly that the treatments do not significantly effect the number 
of depletions in any of the litter sizes. 

The “no-interaction” chi-square confirms these results. This 
chi-square, with eight degrees of freedom, is found to be 3.158, and 
corresponds to a probability of greater than 90 per cent. From this 
result we may conclude that the 2 X 3 X 5 table is homogeneous. 
That is to say, the significant interaction between the number of deple- 
tions and the size of litter is consistent for both treatments, and the 
lack of interaction between the number of depletions and treatment is 
consistent for all litter sizes. 

The “‘no-interaction” hypothesis is tested by the techniques outlined 
above. Table 1 isa 2 X 3 X 5 table; (i.e., r = 2;s = 3;¢ = 5). For 
these data the constraints implied by (2.1) produce eight simultaneous 
third-degree equations in eight unknowns, which may be written as: 


(8 — — — 2115) (18 — — — 
(17 + tis + i25)(5 + 2115) (15 + + + 
(10 — 2113 — X123)(383 — 23) 
(15 + + L123) (45 + 2113) 

(8 — — Li22)(49 — 

(10 + fu + L102)(58 + 2112) 

(5 + + L121)(75 + 211) 


(5.1) 


and 


(8 Vio5)(12 — X25) (18 — — Li04)(13 = Ly 04) 
(17 + + Lies)(15 + 2125) (15 + + 


= (10 — 213 — — 
(15 + tus + L103)(22 + 2123) 


(8 — Lii2 — X190)(14 = 
(10 + Sus + X122)(17 + 2122) 


= (7 — — %2:)(11 — 
(56 + tun + + 2121) 
Associated with these equations are the constraints, (see 2.4): 


+ + + = —Tus ; 


and Lior + + + Lies = 


|| 
| 
(5.2) 
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The solution of equations (5.1), by the generalized Norton approach 
follows the procedure outlined in Table 2. Table 2 is the first iteration 
of approximately twenty which are necessary to ensure six decimal 
place convergence of x3} to zero for all j and k. 

This problem was coded as a pilot study for the ORACLE (Oak 
Ridge Automatic Computer and Logical Engine) at the Oak Ridge 
National Laboratory. Aside from the time spent in coding, the total 
time spent by the computer in solving equations (5.1) and in computing 
the final chi-square statistic was infinitesimal. The expected number 
of observations per cell on the null hypothesis are given in Table 3, 
and the results of the calculation are given in equations (5.3), (5.4), 
and (5.5): 


Tit = Miu = Ma = +3.004982; 
Lii2 = Mie = Mae = +0.620963; 
Lis = Mus = Mas = —1.127163; 
Tis = Mis = = —2.132082; 
Lis = —Mus = = —0.366700; 


Zio = tes = Han = —1.381348; (5.3) 
= = = +0.008489; 
Xi23 = = = +0.530742; 
= Hiss = = +1.920393; 
= = = —1.078276; 
+ 2101) = = = +1.623634; 
(2112 + = tise = Hose = +0.629452; 
(113 + = = Moss = —0.596421; 
(Gira + Trios) = = Hose = —0.211689; 
+ = = = —1.444976; 
bie = 1.74746 bi, = 1.75312 for k = 1,2,3,4,5: (5.4) 
= 3.158 (5.5) 


with 8 degrees of freedom. 

At present a more general ORACLE program is available for prob- 
lems of this type. This program will handle contingency tables of sizes 
r= 2,3,4,5;8s = 2,3, 4,5; and¢ = 2, 3, --- , 15, 16. The general 
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TABLE 3 
ExpecTtED NuMBER OF LITTERS ON THE NuLL Hyporuesis or “No-INTERACTION” 


Number of Depletions 
Treatment 


0 1 2 or More 


54.995018 | 12.381348 6.623634 


78.004982 | 17.618652 5.376366 


48.379037 | 13.991511 | 10.629452 


58.620963 | 17.008489 7.730548 


34.127163 | 17.469258 | 14.403579 


43 .872837 | 22.530742 | 10.596421 


17.132082 | 11.079607 | 14.788311 


36.867918 | 23.920393 | 18.211689 


4.366700 | 13.078276 | 15.555024 


4.633300 | 13.921724 9.444976 


program gives the complete solution to the problem cited above in less 
than sixty seconds. In another problem involving a 5 X 5 X 6 table 
with observed frequencies ranging from one to six digits, the computing 
time was about two hours. Though the time factor may be reduced 
somewhat by a reduction in the number of decimal places required by 
the arithmetic, two hours are perhaps not too much to ask for the 
solution of eighty simultaneous third-degree equations in as many 
unknowns. 
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Introduction 


Designs involving cyclical arrangements of treatments are frequently 
used in experiments extending over several periods in order to reduce 
errors due to variation between experimental units. These designs are 
known as change-over designs. 

In the simplest type of change-over design the sequences of treat- 
ments are determined by the columns of one or more Latin squares. 
Any Latin square can be used when the effects of treatments do not. 
persist beyond the period of application. Frequently, however, it is 
desirable to take residual effects into account. Change-over designs 
which provide for the estimation of first residual effects, i.e., residual 
effects in the period immediately after application, have been devised 
by Cochran, Autrey, and Cannon [1941] and Williams [1949] using 
special Latin squares. The residual effects are estimated by introducing 
additional constants into the ordinary analysis of Latin squares. For 
details of these designs and their analysis reference can be made to 
the review of the subject by Cochran and Cox [1957, p. 133]. 

Designs which permit the number of periods to be less than the 
number of treatments have been investigated by Patterson [1951, 1952]. 
These designs consist of series of incomplete Latin squares and are 
analysed accordingly with provision for first residual effects. 

Yates [1951], Lucas [1957], and Cochran and Cox [1957] have 
pointed out that in the analysis of change-over designs 

(a) residual effects are less accurately determined than direct effects 
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(b) the estimated variances of the sums of direct and residual effects 
for individual treatments (which can be referred to as permanent 
effects) are greater than the sums of the variances of the separate 
effects. 

Cochran and Cox [1957] have also pointed out that 
(c) tests of direct and residual effects are not independent. 

Points (a) and (b) are of little importance when residual effects are 

used only in order to adjust the direct effects for the effects of previous 

treatments. Frequently however, residual effects are as interesting 
as direct effects and in some types of work the estimates of permanent 
effects must be regarded as measures of the true practical values of the 
treatments. The inaccuracies of estimates of residual and permanent 
effects are most serious for change-over designs with small numbers of 
periods. Thus, for three and four period Latin square designs the 
estimated variances of the different effects, relative to the variances of 
treatment means are as follows: 


Comparisons between 3-period 


Treatment means 
Adjusted direct effects (t;) 
Residual effects (r;) 

Permanent effects (t; + 


bo 
Ore 
on 


Lucas [1957] (also reported by Cochran and Cox [1957]) has con- 
sidered, as a means of meeting these criticisms, designs obtained by 
adding an extra period to the basic Latin square patterns. The treat- 
ments of the last period of the basic design are repeated in the extra 
period. An attractive feature of these extra-period designs is that, in 
the analysis, direct and residual effects are orthogonal. As a result the 
estimated variances of the residual and permanent effects are reduced. 

The purpose of the present paper is to consider a wider class of 
extra-period designs obtained from any basic change-over design 
satisfying certain conditions of balance. 


Conditions for balance in the basic change-over designs 


A characteristic of all the change-over designs mentioned above, 
with the exception of the extra-period designs considered by Lucas 
[1957], is that no treatment is applied to any experimental unit in more 
than one period. Any designs of this type which satisfy certain con- 
ditions of balance described below will be referred to as basic change-over 
designs, 
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If there are ¢ treatments and p periods and the experimental units 
are arranged in b blocks of k units, where k > 2, the conditions, which 
must be satisfied for all pairs of treatments 7 and j,7 ¥ j,7,j = 1,2, --- ¢, 
are as follows: 

1. Treatment 7 occurs in each period in bk/t blocks. 

2. Treatment j also occurs in each period in bk(k — 1)/t(é — 1) of these 
blocks. 

3. \ = bkp(p — 1)/t(t — 1) of the bkp/t units which receive treatment 7 
in some period receive treatment j in another period. 

4. \/p units receive treatment 7 then treatment j in successive periods. 
5. \/p units receive treatment 7 in the pth period and treatment 7 in 
another period. 

These conditions ensure that when constants é; and f; are fitted 
for direct and first residual effects, respectively, the variances of all 
differences between the /; are equal and the variances of all differences 
between the f; are equal. Thus the analysis is greatly facilitated. 
Balance with respect to residual effects (condition 4) is, however, 
incomplete since treatment 7 is never preceded by itself. As a result 
direct effects are not orthogonal with residual effects. 

Lucas [1957] has shown that in the case t = k = p complete balance 
with respect to residual effects can be obtained without upsetting the 
balance with respect to experimental units simply by repeating the 
treatments of the pth period in a (p + 1)th period. Each treatment 
then precedes itself and each other treatment equally often. In the 
following sections we will consider the properties of any extra-period 
design derived in this manner from any one of the general class of basic 
change-over designs. 

The conditions for basic change-over designs described above are 
more general than the conditions described by Patterson [1952]* in 
that ¢ is not restricted to be equal to k. If p = k the treatments of 
each block follow a Latin square pattern; conditions 3 and 5 are then 
redundant. It should be noted that when k = 2 direct and residual 
effects cannot be estimated separately in the basic designs. Arrange- 
ments for k = 2 which satisfy the balance conditions do, however, exist 
and will be used in constructing a special class of extra-period designs. 


Extra-period designs 


Suppose now that the treatments of the pth period of any design 
satisfying the above conditions are repeated in a (p + 1)th period. 


*Patterson uses b to denote the total number of experimental units, » to denote the number of 
treatments and k to denote the number of periods. The notation in the present paper appears to be 
more suitable. 
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Constants are to be fitted for direct and residual effects. These will 
be obtained from totals 


T= Du, R= 


units units 
where 
y; is the yield of the unit in the period or periods in which it receives 
treatment 7, 
z,; is the yield in the immediately following period, 
after adjusting for differences between experimental units and periods 
within each block. The adjusted totals (multiplied by p’) are 


Ti = p'T; — (Ua + Vin) (1) 
Ri = p'R, — Us BY 2) 


where 

p’ = p+ 1is the number of periods, 

U,, is the total yield of all units receiving treatment 7 in any period 

just once, 

U,2 is the total yield of all units receiving treatment 7 in the extra 

period, 

P,, is the total yield in period 1 of all blocks receiving treatment 7, 
and 

B; is the total yield of the same blocks over all periods. 

T; and R/ can be expressed as functions of the constants for direct 
and residual effects. In particular we will consider the contributions 
of residual effects to JT; . 7’; is a total of kbp’/t values, kb/t of which 
are preceded by treatment 7 and \/p by each other treatment j (con- 
dition 4). Hence the contribution of residual effects to 7; is (kb/t)r; + 
(A/p) doin: 7; - Similarly U;, is the sum of kbp/t units in each of 
which treatment 7 occurs once in one of the first p periods and in \ of 
which treatment j occurs in one of the first p periods (condition 3). 
Uj is the sum of kb/t units in each of which treatment 7 occurs in the 
pth period and in \/p of which treatment j occurs in one of the first 


p — 1 periods (condition 5). Hence the total contribution of residual 
effects to is 


Thus direct and residual effects are orthogonal. This extends the result 
obtained by Lucas [1957] for the case t = p = k. 


The most useful designs for ¢ = p = k have been listed by Lucas 
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[1957]. Designs for unequal /, p, and k are given by Patterson and 
Lucas (in preparation). Here we will consider only a few examples 
of the different types of design which are available. 

An important series of designs is derived from complete sets of 
orthogonal Latin squares having the property that any square of the 
set can be obtained by permuting the rows of any other square of the 
set, All the sets of orthogonal squares given by Fisher and Yates 
[1953] are of this type. An example fort = k = p’ = 4 is given in 
Fig. 1. This design is obtained by selecting the first 3 rows of each of 
3 orthogonal 4 X 4 squares and repeating the 3rd row. Each square 
defines a block. 


FIG. 1. 
Extra-PERIOop DESIGN For 4 TREATMENTS IN BLOCKS OF 4 


Period Block 1 Block 2 Block 3 


1 23 414.2 3.4 
2 7 it 4313 41 214 3 2.1 
3 3 412)43 21/)2 1 4 8 
4 8214 82 1 


In practice the rows of the Latin squares should be selected at 
random and allocated to periods in random order using the same ran- 
domization for each square. This allocation should be made prior to 
adding the extra period. Treatments are allocated at random to numbers 
and the sequences of each block are allocated at random to the experi- 
mental units. 

This design can also be arranged in blocks of 3 units as shown in 
Fig. 2. 

Designs derived from complete sets of orthogonal squares required 


FIG. 2. 
Extra-PEriop DrEsIGN For 4 TREATMENTS IN BLOCKS OF 3 


Period Block 1 | Block 2 | Block 3 | Block 4 


1 218 4 138 42 444 2 3 
2 248112414 83 113 4 2 
3 3 21/4 1 2]1 4 38/2 3 4 
4 114 4 38 4 


| 
4 
4 
} 


EXTRA-PERIOD CHANGE-OVER DESIGNS 121 


a multiple of f(¢ — 1) units. Designs are also available for fewer than 
‘(¢ — 1) units. An example for 7 treatments requiring only 14 units 
is shown in Fig. 3. The rows of this design must not be randomized. 


FIG. 3. 
Extra-Periop Design For 7 TREATMENTS IN BLOCKS OF 7. 
Period Block 1 Block 2 
1 45 6712 Se 4 7 
2 2B 42S 6 7 ss 
3 315 671.23 4 
4 7 2S. 2-6 4-6 


An alternative design for the same number of treatments and periods 
is given in Fig. 4. This design requires twice as many units as the 
design of Fig. 3; on the other hand the reduction of block size may permit 
more efficient grouping of the units. 

As previously mentioned the basic change-over designs require k 
to be not less than 3 to permit direct and residual effects to be estimated 
separately. There is, however, a series of extra-period designs for any 


FIG, 4. 
Extra-Per1op DEsiGN For 7 TREATMENTS IN BLOCKS OF 4 
Period Block 1 Block 2 Block 3 Block 4 

3 £139 2 2614 2 Gis 2-7 
3 4227 Sie oi 2 
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t, k = 2, and p’ = 3 requiring ¢(¢ — 1) units. These designs are likely 
to be useful when the experimental units fall naturally into groups of 
two, e.g., twins in animal experiments. The two units of each block 
are as follows: 


Period 
1 t j 
2 j t 
3 j t 


There are, altogether, ¢(f — 1)/2 blocks, one for each combination of 7 
and j,t 

In the designs of Figs. 2 and 4 residual effects are partially confounded 
with blocks X periods. The fraction of information confounded is 
(t — k)/k(tp — 1), ie., 4.8% for the design of Fig. 2 and 6.7% for the 
design of Fig. 4. Direct effects are not confounded with blocks xX 
periods. 


The analysis of extra-period change-over designs 


The equations for estimating ¢; and r; (with constraints >> t; = 0, 


> r; = 0) are 


Ap'i, = T!, (3) 
Cp?; = Ri, (4) 
where 
A = n(p’ + 1)(p’ — 2)/p'(t — 1), (5) 
C = — 1)(kp’ — k — — 1), (6) 


n is the total number of units, and 7 and R/ are given by (1) and (2). 
It is worth noting that when p = k equation (2) simplifies to 


(p’ — = (p’ — DR; — By + Pi (7) 


since U;, = B; . Hence, in this case, the estimates of the residual 
effects do not involve the yields of the first period. 

Since the direct and residual effects are orthogonal the analysis of 
variance of extra-period designs is somewhat simpler than that for the 
standard designs. The partition of degrees of freedom and expressions 
for the sums of squares for direct and residual effects are set out in 
Table 1. 

The estimated variances of the direct, residual, and permanent 
effects are 


+! 
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TABLE 1 
ANALYSIS OF VARIANCE OF ExtTrRA-PERIOD CHANGE-OVER DESIGNS 
Variation df. MSS. 
Blocks b-1 
Periods 
Periods < blocks (b — 1)(p’ — 1) 
Units within blocks b(k — 1) 
Direct effects t-—1 1'2/Ap'? 
Residual effects t-—1 R’*/Cp’? 
Error b(p’ — 1k — 1) — 2(t — 1)| (by subtraction) s? 
Total bkp’ — 1 
var (r; — = 28°/C (9) 
var (t; = t; = 2s°(A + C)/AC. (10) 


These estimated variances are, however, biased. It can be shown, 
by considering the asymmetry of contrasts between the d; and contrasts 
between the r; that the average values of the mean squares for direct 
and residual effects over all randomizations are not equal to each other 
or to the average value of the error mean square in the absence of 
treatment effects. 

Similar biases in the estimates of error occur in the analysis of the 
basic change-over designs. These biases have been discussed with 
particular reference to nutrition experiments with dairy cattle by 
Patterson [1950] and Lucas [1951]. Lucas concluded that the biases 
for these designs are not of sufficient magnitude to be of practical 
importance. 

It seems likely, in view of the additional asymmetry introduced by 
the extra period, that the biases in extra-period designs will be some- 
what greater than the biases in the ordinary designs. We will, therefore, 
consider the three period extra-period designs in some detail, again 
with reference to experiments with dairy cattle. 

Dairy cattle are normally introduced into experiments at a little 
after peak lactation. The subsequent declines in milk yield vary from 
animal to animal, and the between-animal variance of linear components 
of the time trend tends to be greater than the variances of the quadratic, 
cubic, --- components. Tor the purpose of this discussion we will 
suppose that the quadratic, cubic, --- components are independent 
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of each other and of the linear components and have equal variances 
per period of o”. The variance per period of the linear components will 
be denoted by oj . 


TABLE 2 


Ratios oF TRUE VARIANCES TO AVERAGE ESTIMATED VARIANCES 
IN ExTRA-PERIOD DEsiGns WITH p’ = 3 


Ratios when is 
Effects t 1.4 2 5 

Direct >2 1.08 $47 1.33 
Residual 2 0.92 0.83 0.67 
3 0.95 0.87 0.73 

4 0.94 0.88 0.76 

large 0.96 0.92 0.83 

Permanent 2 1.13 1.26 1.52 
3 1.27 1.55 

4 .14 1.28 1.56 

large 1.14 1.29 1.58 


Lucas [1951] determined an average value of o7/o” equal to 1.4 for 
a series of experiments with three periods and k = 3. The ratios of 
true variances to expectations of estimated variances with this value 
of o{/co° are set out in Table 2 for extra-period designs with p’ = 3. 
Biases are somewhat greater here than with the basic designs, for which 
the corresponding ratios are approximately 1.07 for direct and permanent 
effects and 1.00 for residual effects. But they are probably still not 
large enough to have serious consequences. Ratios are also given in 
Table 2 for greater values of o7/o’. 

In general, the value of o//c’ is likely to depend to some extent on 
the efficiency of the method of allocating animals to groups (i.e., blocks). 
If animals in the same block are placed on experiment at the same time, 
and are of similar production level, age, physiological state, etc., 7/0” 
is not likely to be large. Thus the most precise experiments can be 
expected to be subject to the smallest biases in estimates of error. 

Similar biases are obtained in designs with other numbers of periods. 
In each case the errors of direct and permanent effects are underestimated 
and the errors of residual effects are overestimated if there is a sub- 
stantial linear component. 

Possibly the most serious bias for practical purposes is that in the 
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estimated errors of residual effects. A better estimate of error can be 
obtained for designs with p = k by means of an analysis of variance 
omitting the first period. The calculation of this estimate is sometimes 
difficult and will not normally be worthwhile unless a critical test of resi- 
dual effects is required. 

As previously pointed out estimates of residual effects are obtained 
entirely from the results of the 2nd, 3rd, --- , p’th periods when p = k. 
Hence the sum of squares for residual effects is the same in the new 
analysis of variance as in the analysis of Table 1. On the other hand, 
the sum of squares for direct effects requires to be recalculated. This is 
relatively simple in the case of designs derived from complete sets of 
orthogonal squares. If 7; , U;, , and U;2 are now defined with reference 
to the yields of the 2nd, 3rd, --- , p’th periods and 


Ti = (p’ — OT; — Ua + (11) 
the estimates of the direct effects in the p periods are given by 
bp’(p’ — 1)(p’ — = (12) 
Hence the required sum of squares for direct effects is 
1) T"/bp'(p’ — 1)°(p' — 8). (13) 
The partition of degrees of freedom is set out in Table 3. 
TABLE 3 
ANALYSIS OF VARIANCE OmITTING First PERIOD 
Expectation 
Variation d.f. MS. of M.S. 
Blocks b-1 
Periods p'—2 
Periods X blocks (b — 1)(p’ — 2) 
Units within blocks b(k — 1) 
Direct effects t—-1=f 
Residual effects t—-1=f 
Error b(p’ — 2k —1) —2(t-—1) =f. Zs 
Total bk(p’ 1) 


If the average values of the mean squares for direct effects, residual 
effects, and error over all possible randomizations in the absence of 
any treatment effects are Z, , Z, , and Z; respectively, 


= (fZ, + + f.)- 
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Consequently 


Z; = + f)Ze {Z,}/f. 


which is nearly Z, if f, is much greater than f. Hence s” provides a 
nearly unbiased estimate of the error of residual effects provided that 
the number of degrees of freedom for error in Table 3 is sufficiently large. 

With designs which are not derived from orthogonal Latin squares 
the determination of the sum of squares for direct effects in Table 3 
usually presents some difficulty. Thus for the design of Fig. 4 the 
equations replacing equation (12) are, in matrix form, 


2X 


L—3 —3 -4-4-3 -3 20) LT? 


where the matrix of coefficients of the /; is a circulant matrix. The é, 
are given by 


| 11593, 22, 1,505,505, 1, 22|/ 7%] 
i, 22, 11598, 22, 1, 505, 505, 1||7% 

1,505,505, 1, 22, 11593) 


where the matrix of coefficients of the 7’; is also circulant. The required 
sum of squares is then >>; 

If k > p in a design derived from a complete set of orthogonal 
squares, residual effects can still be tested by an analysis of the type 
described above, but a fraction (k — p)/p(kp — 1) of information on- 
residual effects will be lost. In such cases the residual effects are given by 


n(p’ — = 
where 
(p’.— DR; — 


R, is determined from the z’s in periods 2 to p’ and U% is the total yield 
over periods 2 to p’ of all units receiving treatment 7 in one of the first 
p periods. The sum of squares for residual effects is 


(¢-— 1) — — 2) 
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and the variance of a difference between two values of 7; is estimated by 
2(¢ — — 2). 
Example of the analysis 


The data set out in Table 4 are fat corrected milk yields per day 
taken from a change-over trial with dairy cows in which the treatments 
had little effect. The cows have been allocated to blocks according to 
performance in a preliminary period. A randomized version of the 
design of Fig. 2 was then superimposed on the data. 

Marginal totals are computed as shown in Table 4. The computa- 
tions required to estimate direct and residual effects are set out in 
Table 5. Since p = k in this case U, = B and (7) is used in place of (2). 
It is convenient to define R” = 3R’/4. Since t = 4, p’ = 4,k = 3, 
b = 4,andn = 12, A = 10 and B = 8 [from (5) and (6)]._ Hence from 
(3) and (4) é = 7’/40 and # = R’/32 = R”’/24. The quantities shown 


TABLE 4 
AVERAGE Datty FCM Propuction Pounps 


2 


Block 1 Block 2 


Period 1 2 3 Total 1 2 3 Total 


38.7(1)* 48.9(4) 35.2(3) | 122.8 | 34.6(1) 32.9(3) 30.4(2)| 97.9 
37.4(4) 46.9(3) 33.5(1) | 117.8 | 32.3(3) 33.1(2) 29.5(1) 94.9 
34.3(3)  42.0(1) 28.4(4) | 104.7 |  28.5(2) 27.5(1) 26.7(3) | 82.7 
31.3(3)  39.6(1) 25.1(4) | 96.0 | 27.1(2) 25.1(1) 23.1(3)| 75.3 


Total} 141.7 177.4 122.2 441.3 | 122.5 118.6 109.7 350.8 


Block 3 Block 4 


Period 
Period 1 2 3 Total 1 2 3 Total | Total 


25.7(3) 30.8(4) 25.4(2) | 81.9 | 21.8(1) 21.4(2) 22.8(4) |° 66.0 | 368.6 
26.1(4) 29.3(2) 26.0(3) | 81.4 | 23.9(2) 22.0(4) 21.0(1) | 66.9 | 361.0 
23.4(2) 26.4(3) 23.9(4) | 73.7 | 21.7(4) 19.4(1) 18.6(2) | 59.7 | 320.8 
18.7(2) 23.2(3) 19.9(4) | 61.8 | 17.6(4) 16.6(1) 16.1(2) | 50.3 | 283.4 


Total | 93.9 109.7 95.2 298.8 | 85.0 79.4 78.5 242.9 | 1333.8 


*Treatment symbols in brackets 
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in column (10) of Table 5 are adjusted treatment means obtained by 
adding the overall mean, 27.79 to each é; the quantities shown in column 
(11) provide estimates of permanent effects. 

The analysis of variance, following Table 1, is set out in Table 6. 
Standard errors derived from s’, the error mean square, are shown in the 
last three columns of Table 5. Thus 0.732/10 = (+0.27)’. 


TABLE 5 
EstTIMATION OF DirEcT AND RESIDUAL EFFECTS 


(1) (2) (3) (4) (5) (6) (7) (8) (9) (10) qd). 
Treat-| 7 R B U2 Pi iad RY i i+r 
ment (+£0.30) | (-40.27) |(- 0.41) 


349.3 | 248.6/1035.0 | 375.4 | 286.7 |—13.2 |—2.5 |—0.33 | —0.10 | 27.46 | 27.36 
295.9 | 215.0) 892.5 | 294.9 | 245.8 | —3.8 |-1.7 |—0.10 | —0.07 | 27.69 | 27.62 

. 361.1 | 302.6 4.0 5.8} 0.10 0.24 | 27.89 | 28.13 
324.6 | 236.9) 983.0 | 302.4 | 270.7 13.0 |—1.6 | 0.32 | —0.07 | 28.11 | 28.04 


mote 
° 
to 
3 
= 
© 


Total |1333.8 | 965.2)/4001.4 |1333.8 |1105.8 0.0 0.0 |—0.01 0.00 | Mean = 27.79 


TABLE 6 
ANALYSIS OF VARIANCE 
Variation df. Sum of squares Mean square 

Blocks 3 177.7142 
Periods 3 388.2958 
Blocks X periods 9 19.3092 
Units within blocks 8 458.0033 
Direct effects 3 > 77/160 = 2.3355 0.778 
Residual effects 3 > R”2/72 = 0.6274 0.209 
Error 18 13.1671 0.732 

Total 47 2659 . 4525 
Discussion 


In the present section we will consider, with reference to the precision 
of estimates of the treatment effects, the problem of deciding whether 
or not an extra-period design should be used. A decision need not be 
taken until p periods have been completed (except in the case of designs 
in blocks of 2 which necessarily require 3 periods). Two situations can 
arise at this point: 


| | 
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(a) The design for p periods cannot be converted into a basic change- 
over design with p + 1 periods. The alternatives in this case 
are either to terminate the experiment or to add an extra period 
repeating the treatments of the pth period. 

(b) Conversion is possible. The alternatives to be considered are 
then to terminate the experiment, to change the treatments of 
the pth period in a (p + 1)th period, or to repeat them. 

The efficiency factors shown in Table 4 provide a limited amount of 


guidance in case (a) or the counterpart in case (b). These factors are 
given by 


Ve (p “+ 1) 


where Vs; and Vy, are the variances of the p period design and the 
extra-period design (taking account of residual effects) when the errors 
in each period are independent and have equal variances. The ratio 
p/(p + 1) is introduced to allow for the cost of the additional period. 
In practice the cost of the additional period will often be less than a 
fraction 1/(p + 1) of the total cost of the experiment. In such cases 
the efficiency factors of Table 7 tend to be too low. On the other hand, 
with some types of experimental material the true variances per period 
of the extra-period design can be expected to be somewhat larger than 
those of the p period design. 

The high values of the efficiency factors for p’ = 3 merely reflect 
the virtual impossibility of separating direct and residual effects in a 
2-period design. However, there are few 2-period designs for k > 2 
in the practical range of t which cannot be converted into a balanced 
3-period design. 

Nevertheless the efficiency factors for p’ > 3 indicate that extra 
periods may be of considerable value, particularly when additional 
information is required on residual and permanent effects. 

It is also possible in some circumstances that extra periods may be 
contemplated when there are no residual effects. The efficiency factor 
for direct effects in such cases is simply p(p + 2)/(p + 1)’. 

Case (b) can be treated rather more completely. The efficiency 
factors shown in Table 8 compare the variances of extra-period designs 
with the variances of the basic change-over designs for the same numbers 
of treatments, periods, units, and blocks. Various values of o;/o” are 
introduced to take account of the biases in the error variances discussed 
in an earlier section. The efficiency factors vary only slightly with k 
and are therefore given only for large k when o;/o” > 1. 

The extra-period designs are, therefore, preferable to the basic 
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TABLE 7 


PERCENTAGE EFFICIENCIES OF ExTRA-PERIOD DESIGNS RELATIVE 
To Basic Desians WitTH ONE Less PEriop (REsmpuAL EFFEcTs PRESENT) 


With k equal to 
p=pt+l ‘2 3 4 large 
Direct effects 
3 © 356 267 178 
4 -- 118 116 113 
5 _ _ 106 105 
6 — 102 
Residual effects 
3 © 889 662 356 
4 - 169 164 152 
5 128 124 
Permanent effects 
3 © 1130 788 444 
4 — 208 202 186 
5 153 148 


designs with p’ periods when residual effects are of interest, particularly 
with small numbers of periods. The efficiency factors for permanent 
effects are also high, provided that o7/o” is small. 

When direct efiects only are required it is better to use the basic 
balanced change-over designs, whether residual effects are present or not. 


Summary 


This paper deals with extra-period designs obtained by repeating the 
treatment pattern of the last period of any design in a general class of 
basic change-over designs. The basic designs are derived from Latin 
squares or incomplete Latin squares and satisfy certain conditions of 
balance which facilitate the estimation of direct and first residual effects. 

The extra-period designs have the useful property that the estimates 
of direct effects are orthogonal with the estimates of first residual effects. 

The construction and analysis of the extra-period designs are de- 
scribed. As with the basic designs, the estimates of error given by the 
method of fitting constants for direct and residual effects are biased. 
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TABLE 8 


PERCENTAGE EFFICIENCIES OF EXTRA-PERIOD DeEsiGNs RELATIVE TO 
Basic Designs WITH THE SAME NuMBERS OF PERIODS 


With oj/o? = 1 when 
p=pt+i1 kis With = 2| With = 5 
“ 3 + 5 large| and k large and k large 
Direct effects 
3 83 82 82 80 76 74 
4 — 92 92 91 87 83 
5 —- — 9 95 90 86 
6 —- — — 97 93 88 
Residual effects 
3 125 124 123 120 135 154 
4 — 110 110 109 119 134 
5 — — 106 105 112 124 
6 —- — — 103 108 117 
Permanent effects . 
3 152 148 146 140 125 115 
4 — 132 131 129 119 110 
5 — — 126 123 115 107 
6 —- — — 118 112 106 


Nearly unbiased estimates of the errors of residual effects can be obtained 
with certain types of design. 

The efficiencies and uses of extra-period designs are discussed in 
general terms. The designs are likely to be most useful when increased 
accuracy is required in the estimates of residual and permanent effects. 
They can, however, also be used to advantage 
(a) to extend for one further period an experiment which would otherwise 
have to be terminated, and 
(b) to provide balanced designs in blocks of two units. 


REFERENCES 


Cochran, W. G., Autrey, K. M., and Cannon, C. Y. [1941]. A double change-over 
design for dairy cattle feeding experiments. J. Dairy Sci. 24, 937. 

Cochran, W. G., and Cox, G. M. [1957]. Experimental Designs. New York: Wiley. 

Fisher, R. A., and Yates, F. [1953]. Statistical Tables for Biological, Agricultural, 
and Medical Research. Edinburgh: Oliver and Boyd. 


alt 
| | | 
\ 

, 

S. 

2S 

S. 

1e€ 
d. 


132 BIOMETRICS, MARCH 1959 


Lucas, H. L. [1951]. Bias in estimation of error in change-over trials with dairy 
cattle. J. Agric. Sci. 41, 146. 

Lucas, H. L. [1957]. Extra-period Latin-square change-over design. J. Dairy Sci. 
40, 225. 

Patterson, H. D. [1950]. The analysis of change-over trials. J. Agric. Sci. 40, 375. 

Patterson, H. D. [1951]. Change-over trials. J. R. Stat. Soc. B 13, 256. 

Patterson, H. D. [1952]. The construction of balanced designs for experiments 
involving sequences of treatments. Biometrika 39, 32. 

Patterson, H. D. and Lucas, H. L. [in preparation]. Tables of change-over designs 
with provision for first residual effects. 

Williams, E. J. [1949]. Experimental designs balanced for the estimation of residual 
effects of treatments. Aust. J. Sci. Res. A 2, 149. 

Yates, F. [1951]. Bases logiques de la planification des expériences. Ann. Inst. 
H. Poincaré 12, 97. 


« | 
ig 
4 
I 
GEE Q 
4) 
‘ht ~ 


QUERIES AND NOTES 


D. J. Finney, Editor 


135 QUERY: Replication of Non-Center Points in the Rotatable and 
Near-Rotatable Central Composite Design 


In reference to blocking arrangements for the rotatable and near- 
rotatable central composite design [Multifactor experimental designs 
for exploring response surfaces by G. E. P. Box and J. 8. Hunter, 
Annals of Mathematical Statistics 28: 195-241, 1957], I should like to 
know the following: 

In the event that blocking is necessary, and a decision is reached to 
replicate non-center points in addition to center points in blocks other 
than the axial block, is the number of points in the cube increased by 
the total number of replicates? If this is the case, is this adjusted value 
then used to determine the number of center-points and a@ required for 
orthogonal blocking? 


ANSWER: 


Suppose we are to make N observations in m blocks in accordance 
with a particular design which defines the levels at which k variables 
2, ,%2,°** , 2% will be held in each of N trials. If it is assumed that, 
when block differences are allowed for, a second-degree polynomial in 
the variables will provide an adequate representation of the true response 
n Within the region of interest, and if the response at the origin 
(x, = 2 = +++ = a = O) in the wth block is 8, , we can write the 
model for the uth set of experimental conditions as 


m k k k 
i=1 i=1 j=1 


w=1 


where z,, is a “dummy” variable taking the value unity for the n/, trials 
falling in the wth block and zero otherwise, where >5"_, nf = N. 

Suppose we have a design for which the sum of the x’s taken over 
all the observations is zero, i.e., 


1,2,---,b, 


u=1 
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and the sum of products for each pair of the 2’s is zero, i.e., 


N 

u=1 
then it is shown in general in the paper mentioned above that orthogonal 
blocking is achieved (that is to say all estimates of linear coefficients 
8; , linear X linear interaction coefficients 8;; , and quadratic coefficients 
8;; are completely uncorrelated with block comparisons) if within each 
block 

(i) the sum for each of the 2’s is zero, i.e., 


= 0 w=1,---,m, a=1,---,k. 


u 


(iii) the fraction of the total sum of squares for each variable x, 
contributed in each block is proportional to the number of 
observations in that block, i.e., 


A central composite design is one useful arrangement for fitting a 
second-degree polynomial in k variables. This design can be split up 
into two parts, a ‘“‘cubic” part and an “axial” part, which are made the 
basis of blocking arrangements. The “cubic” part of the design consists 
of a two-level factorial or a suitable (4)” fractional replicate of it such 
that one-factor and two-factor effects are all unconfounded with each 
other. In standardized coordinates in the k-dimensional space of the 
variables these trials are represented by n, the 2" points at the vertices 
of a cube with coordinates (+1, +1, --- , +1) or in the case of a (3)’ 
fractional replicate by 2‘~” of these points. Associated with these trials 
are a further n.. trials made at the center conditions. Together the 
nN, + No trials provide what we shall call the ‘‘cubic block.” The “‘axial” 
part of the design consists of the 2k additional points of a k-dimensional 
cross with coordinates (+a, 0, --- , 0); (0, ta, --- , 0); --- 
(0,0, --- , +a). Associated with this are Nao further at the 
center BRE Posse Together the n, + 7,9 trials provide what we shall 
call an “axial block.” If the cubic block is replicated r, times and the 


a 
(ii) the sum of products for each pair of the z’s is zero, i.e., 
2 
m n'w ~ w= 1,2,---,m, t= 1,2,--- ,k. 
2 
w u 
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axial block is replicated r, times there will be N = r.(n, + neo) + 
+ trials in all. 
It is clear that each part of the design satisfies conditions (i) and (ii). 
To satisfy condition (iii) we must also have 
r,2a° + Nao) 2a” Na + Nao 


= 1.e. 
+ Neo) Ne Ne + Neo 


This is the same as equation (87) in the reference, which is appropriate 
for a single replication of each part. It follows that replication of the 
cubic or axial blocks will not affect the value of a needed for orthogonal 
blocking. Unequal replication of the axial and cubic parts of the 
design will, of course, affect the value of a needed for “rotatability.” 
Often the cubic part of the design is itself split up into orthogonal 

sub-blocks. This is achieved by arranging that no two contrasts corre- 
sponding to first-order and second-order effects are confounded in the 
factorial part of the design. Using relations (i) and (ii) and (iii) it will 
be seen that orthogonal blocking is preserved if any sub-block is repli- 
cated with its appropriate center points any number of times. We 
cannot achieve rotatability with such designs, however, unless groups 
of blocks which together make up second-order rotatable designs are 
replicated equally. 

G. E. P. Box 

Statistical Techniques Research Group 

Princeton University 

Princeton, New Jersey 


136 QUERY: Significance of Difference Between Two Non-Inde- 
pendent Correlation Coefficients 


In testing the significance of the difference between two correlation 
coefficients based on the same sample and representing the correlation 
of a variable x, with y and of the variable x, with y, how large should n 
be before the z-distribution is appropriate, or is it ever appropriate? 


ANSWER: 


The difference between two independent correlation coefficients may 
be readily tested for significance using the 2z-transformation due to 
Fisher. As numerous: investigations have shown, the transformed 
correlations are approximately normally distributed about the popula- 
tion value, and treating them as normal is satisfactory for most practical 
purposes, even for fairly small samples. 
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When two population correlation coefficients are not independent— 
as for example when they are the correlations of two variates with a 
third—there is no exact test for the significance of their difference. 
Here the z-transformation is not effective, since it leaves the trans- 
formed values correlated. The most satisfactory test, due to Hotelling 
[1940], is fairly simple to apply and is therefore to be recommended. 

Suppose we wish to compare the correlations of y with x, and 2, , 
based on a sample of n observations. If we denote these correlations by 
r, and r, , and the correlation of x, with x, by r,. , then the test statistic, 
which is distributed as t with n — 3 degrees of freedom, is 


(n — 3)0 + 110) 
t= r) 2D 


where 


rel =1l—r — rie + 


| 


This test is strictly a conditional test, and is therefore seldom exact 
in application to actual data. It is valid for comparing the efficiency, 
as predictors of y, of the given sets of values of x, and x, , without 
reference to any population from which they might have been drawn. 
It cannot, however, validly be extended to drawing conclusions about 
future observed values of x, and x, . Treated as an unconditional 
test, it underestimates significance since it does not take into account 
the possible variation in x, and x, from sample to sample. 

Since, when correlations are being compared, we are usually interested 
in the relationships in the population from which the sample is drawn, 
the conditional nature of the test is a handicap. Nevertheless, the test 
may often be regarded as a satisfactory approximation, when used for 
drawing an inference about the population. 


KE. J. 

Institute of Statistics 
Universily of North Carolina 
Raleigh, North Carolina 
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The following papers were presented at meetings of the Fourth Inter- 
national Biometric Conference which was held in Ottawa from August 
28 to September 2, 1958.* 


R. E. COMSTOCK (University of Minnesota, St. Paul, Minne- 
527 sota). Dominance, Genotype-Environmental Interaction, and 
Homeostasis. 


Attention is drawn to the fact that data for estimation of genetic 
variance are often collected from material grown in only one stratum 
of the totality of environments for which the estimates are desired to 
apply. Such estimates are then confounded by variance arising from 
interactions of genotype and environment. 

The principal question examined is whether the ratios of estimates 
of additive and dominance variance are then unbiased. General logic 
and consideration of expressions for the components of interaction 
variance and the components of genetic variance lead to the conclusions 
that the upward bias in additive genetic variance estimates can easily 
(and may often) be larger than that of dominance variance estimates. 

The general conclusion is that data from an experiment conducted 
within a single stratum of environments cannot give conclusive informa- 
tion on genetic variance ratios. To set a maximum to the ratio of 
dominance variance to additive genetic variance only information on 
the magnitude of bias in the additive genetic variance estimate is 
required. 


K. Mather (University of Birmingham, Birmingham, England). 


528 The Balance Sheet of Variability. 


Provided that there has been no selection and in the absence of 
interactions between non-allelic genes, the genetic variability in the 
segregating generations descended from a cross between two homozygous 
lines appears in three components. First, there is the D component, 
depending on differences between individuals homozygous for allelic 


* Publication of these abstracts of papers of the Fourth International Biometric Conference 
was supported in part by a grant from the United States National Science Foundation. 
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genes, and second, there is the H component, depending on departures 
of heterozygotes from the average of the two corresponding homozy- 
gotes. Both of these contribute to the variation within each generation, 
measured round its own mean. ‘Thirdly, there is the departure of the 
generation mean from the mid-parent value, found as the average of 
the two parental homozygous lines and this is a function of S(h), the 
notation being that of Mather [1949a]. Since genetic variation measured 
round the mid-parent value is the sum of the variance round the genera- 
tion mean and the squared deviation of this mean from the mid-parent, 
it may be written in terms of D, H, and S*(h), each component taking 
a coefficient appropriate to the generation. These coefficients must 
sum to unity [Mather 1949b]. Thus if we write the variation of any 
generation as xD + yH + 2S*(h), thenz + y+2= 1. For example, 
in the F, , x = y = 0 and z = 1, while in the F, generation z = 3, 
y = 3, andz = } = }. In F; we must add Vir. = 4D + 3H and 
Vor; = 4D + $§H, these being the genetic variances between and within 
families, to obtain the genetic variance round the generation mean, 
which is 38(h). Thenz = #andz=P= 
js. Similarly for the S; generation obtained by sibmating in F, , 
= =} =}. 

The total variability as measured from the phenotypes will vary 
with the generation according to the relative magnitudes of D, H, and 
S’(h). But in a deeper sense the differences are no more than a re- 
distribution among the components. Genetically the total variability 
is constant as is indeed required by the permanence of genetic differ- 
ences. A similar relation holds for the components of variation depend- 
ing on genotype-environment interactions, and a similar balance sheet 
may be struck for them [Mather and Morley Jones 1958}. 

The situation is changed in detail, though not in principle, if selection 
is practised. Some of the gene differences will then be lost because 
certain alleles will be fixed in the sense that every member of the popula- 
tion will become homozygous for them. This will be reflected in a 
corresponding and permanent departure of the mean from the mid- 
parent and this departure will thus come to contain a further component 
depending on the d increments of the fixed genes as well as the h incre- 
ments of those still unfixed. 

Returning, however, to the case where selection is absent, the 
visible variability as measured by xD + yH + 2S’(h) is not the total 
of the variability contained in the cross. Indeed it may be only a 
small part of it. Let us consider a very simple example where the two 
parent lines differ in k unlinked genes, each contributing the same d incre- 
ment and all with h = 0. The maximum difference obtainable is where 
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all the increasing alleles are associated in one line and all the decreasing 
alleles in another. Fach line then departs by S(d) from the mid-parent 
so that the total variation is S’(d) = (kd)? = kD. Now in the F, from 
crossing two such lines, half the variation is present in the heterozygous 
state and will be reflected in the H and S*(h) components. The other 
half is present in differences between homozygotes and is measured by 
the D component. But half the total variability is }kD of which 4D 
constitutes only the fraction 1/k. (k — 1)/k of the variability is con- 
cealed, obviously as a result of balancing combinations of homozygous 
genes of the type AAbb, etc. The more genes there are the greater will 
be this concealed fraction (k — 1)/k. This concealed fraction is the 
homozygotic potential variability of Mather [1943]. With linkage the 
concealed fraction is increased where the repulsion phase is preponderant 
and decreased where coupling relations preponderate. It is not difficult 
to accommodate inequality of the h values of d in the calculation, but 
the simple case outlined will serve to illustrate the point. 

Two balance sheets of variability may thus be struck. The first 
relates to the immediately measurable variability which may be redis- 
tributed among the three components and this may be represented by 
the relation x + y + z = 1 in the expression xD + yH + zS*(h). The 
second balance sheet goes deeper, for it covers all the variability, 
potential as well as free, in the cross and it depends on the additional 
parameter k. In principle, k can be estimated in various ways by 
quantities generally denotable as K, the number of effective factors 
[Mather 1949a]. Such estimates are, however, difficult to arrive at 
reliably, and little attention has so far been paid to their calculation. 
Nevertheless, without reliable estimation of the number of effective 
factors, the balance of potential and free variability cannot be struck 
and the magnitude of ultimate response to selection cannot be calcu- 
lated. The derivation of reliable methods of estimation is long overdue. 
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O. KEMPTHORNE (lowa State College, Ames, Iowa). Bio- 
529 metrical Relations Among Relatives With Reference to Selection 
Theory. 


The present status of the theory of selection for a quantitative 
attribute, controlled by an unknown number of factors, is discussed, 
and the use of polynomial regressions is considered. The possible gains 
from the use of prediction formulae based on the use of both parents 
and of other relatives are considered. The development requires defini- 
tion and study of components of genotypic variability additional to the 
now common ones of additive variance, dominance variance, additive X 
additive variance, and so on. 


E. C. R. REEVE and J. C. GOWER (Institute of Animal 
530 Genetics, Edinburgh, Scotland, and Rothamsted Experimental 
Station, Harpenden, Herts., England). The Effects of Inbreeding 
When Selection Acts Against Homozygosis at One or More Loci. 


Selection against homozygotes at certain loci is probably of general 
occurrence in normally outbreeding species, and will reduce the rate of 
progress towards homozygosity when such a species is inbred. A 
realistic theory of inbreeding must, therefore, take into account such 
effects. 

Selection acting at a single locus will have both a direct effect on the 
speed of inbreeding at this locus, and also indirect effects, through 
linkage, on points near it on the same chromosome. The direct effects 
have been investigated theoretically for all the standard inbreeding 
systems, but mathematical formulation of the linkage effects leads in 
most cases to very large generation matrices. 

The simplest sib-mating case—equal selection against both homozy- 
gotes at one locus and its effect on a linked locus—which gives a 19 X 19 
matrix, has been analysed with the help of an electronic digital com- 
puter, and the results are here compared with the corresponding situa- 
tion under selfing. 

A given selection pressure causes a relatively greater reduction in 
inbreeding speed at the locus directly affected, under sib-mating than 
under selfing, but the linkage effects are dispersed more rapidly as we 
move away along the chromosome, under sib-mating. This means that 
selection at one point on a chromosome has an effect which is locally 
more severe, but is also more localised in terms of the length of chromo- 
some affected, under a milder than under a more rapid system of in- 
breeding. 

Analysis of the effects of linkage to two loci each carrying only 
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recessive lethals, in the two systems, suggests that selection against 
homozygosity at several points on the same chromosome will tend to 
sharpen the differences in inbreeding speed between different systems 
in the neighbourhood of these points, and to lessen these differences at 
loci in intermediate regions. 


C. C. COCKERHAM and F. G. MARTIN (North Carolina 
531 State College, Raleigh, North Carolina). The I.B.M. 650 and Its 
Uses in Selection Studies. 


The use of high speed computers in attacking problems in quantita- 
tive genetics.is being explored. A program mimicking a population 
evolving under mass selection has been written for the I.B.M. 650. 
The mating of selected parents, segregation, crossing over, and environ- 
mental variates were all stochastically determined with random numbers. 

Results were presented for variation in population size, selection 
intensity, dominance heritability, and reeombination. With as many 
as twenty loci tight linkages slowed down the rate of progress and 
caused some undesirable genes to be fixed. The effect of linkage was 
more pronounced with fewer parents saved. 


D. S$. ROBSON (Cornell University, Ithaca, New York). 
532 Genetical and Statistical Theory for Quantitative Inheritance 
Studies of Haploids. 


Variance component analysis of a panmictic haploid population is 
based upon a model conceptually identical to the ordinary, N-factor, 
linear statistical Model II. Variance components for the N main 
effects in the statistical model are added together to form the “additive 
variance component” in the genetic model; similarly, the sum of first 
order interaction components variance is called, in the genetic situa- 
tion, the “additive X additive variance component,” and so on. The 
corresponding variance component structure of a simple panmictic 
autopolyploid population may be viewed algebraically as a special case 
of the haploid structure. 

The variance component method of analysis is applicable only to 
breeding systems in which the expected genotypic distribution is sta- 
tionary; two different breeding systems, one with changing genotypic 
frequencies and one with constant genotypic frequencies, complement 
one another in providing estimates of first and second degree genetic 
parameters respectively. The analysis of such a pair of breeding sys- 
tems, recurrent backcrossing and full sib, is described here in detail. 
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533 F. KEITER Gerichtsanthropologisches Laboratorium, Hamburg, 
Germany). The Problems of Multifactorial Genetics in Man. 


A survey of the restricted possibilities in studying human genetics 
according to biometric principles is discussed. Continuous, normal 
distributions are the rule. Some exceptions are given. Dominance and 
interaction are problems of lesser concern in man, however. Twin 
research and the enormous amount of knowledge on morphological 
traits and their variability in man are the two points where anthropology 
can make contributions of special value. The interesting problem of 
changing manifestation of the same traits in different general conditions 
such as sex, age, absolute size of the trait, inconsistent auxiliary scales, 
is illustrated with examples. The variability of children is nearly 
uncorrelated with the difference between the values of the same trait 
in the parent if the midparent value is constant. This phenomenon is 
already known well enough for the case of race mixture in man; it holds 
also within a single population. A “family type” of physiognomy, 
contrary to popular belief, does not exist. All the facts can be well 
explained by the combination and variations of the different traits 
involved. Biometric genetics is the general foundation also for etho- 
logical (behavioral) traits. It is of much greater importance than is 
at present recognised in medical genetics. Monogenism is a special 
case of multifactorial conditions with the number of gene pairs being 
one. 


A. ROBERTSON (Institute of Animal Genetics, Edinburgh, 


534 Scotland). Designs for Measuring Heritability and Genetic 
Correlations. 


The first section is concerned with the optimum group size in herit- 
ability measurements in reference to a fixed total number of individuals 
measured. For a single classification, the optimum group size (n) is 
the reciprocal of the intraclass-correlation ¢. In a double classification 
of s sires and d dams, the optimum for the sire estimate (ndt ~ 1) is 
incompatible with that for the dam estimate (dt ~ 1). With the condi- 
tion that the two shall have the same sampling variance, the optimum 
appears to lie with d between 4 and 10 and n equal to 1/t(d + 1)'”. 
Small values of n are extremely inefficient in both cases as is d = 2 in 
the latter case. 

By throwing the analysis for two characters into the form of a group 
X character interaction, formulae are then developed for the sampling 
variance of the genetic correlations coefficient (r,) derived from the 
between group variance and covariance components for a single classifi- 
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cation. The solution is so far restricted to equal heritabilities of the 
two characters. The expression takes the simplest form when r, = 1, 
(the within group correlation coefficient), when we have 


1 — rs.e. (h’) 


V2 


where the standard errors refer to the estimates from the same data. 
This formula is independent of the degree of relationship between ° 
group members (parent-offspring, full sib or half-sib). Optimum 
structure is therefore the same as in the heritability analysis. 

The formulae are then used to discuss design of genotype-environ- 
ment interaction experiments in which r, is in the neighbourhood of 1 
and r,, is effectively zero. The optimum group size in each environment 
is then more than twice that for the estimation of heritability in that 
environment. 


s.e. (r,) & 


H. LeROY (Institute of Animal Breeding, Zurich, Switzerland). 
535 The Interpretation of Calculated Heritability Coefficients With 
Regard to Gene, Environmental, and Interaction Effects. 


The paper (1) gives general solutions for analysis of variance and 
regression analysis applied to problems of quantitative inheritance when 
all components except the general mean are random, and (2) shows the 
effects of gene, environment, and particularly their interaction on the 
structure of heritability estimates. 

The theory covers two-way and hierarchical classifications and the 
regression type of analysis. The findings are then applied to several 
common types of genetical experiment. 


K. GOODWIN, G. E. DICKERSON, and W. F. LAMOUREUX 

536 (Kimber Farms, Inc., Niles, California). An Experimental 
Design for Separating, Genetic and Environmental Changes in 
Animal Populations Under Selection. 


This paper describes a breeding technique which effectively sepa- 
rates genetic and environmental trends in animal populations under 
selection. It has been used in our flocks of White Leghorn chickens 
since 1954. 

The procedure basically depends upon matings which are repeated 
identically (except as mortality eliminates some) during two successive 
generations. From this basic scheme, plus a further series of matings 
which takes into account changes in maternal effects due to aging of 
the dams, it is possible to: a) make intra-year comparisons of progeny 
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from one-year-old dams of two sucessive generations to estimate the 
genetic change between the generations; b) make inter-year comparisons 
of progeny from pairs of full sisters, each one year old and mated to 
the same males, to estimate the environmental change between the two 
years; and c) make intra-year comparisons of progeny from pairs of 
full sisters mated to the same males, but differing in age by one year, 
in order to estimate the change in maternal effects with aging of the dam. 


J. A. NELDER (National Vegetable Research Station, Welles- 
537 bourne, England). The Estimation of Variance Components in 
Certain Types of Experiment on Quantitative Genetics. 


For experiments based on generations obtained from hybrids of pure 
lines, Mather has formulated a model in which sample variance and co- 
variance estimates have expectations which are linear functions of the 
population-variance components whose estimation is required. The 
method of estimation hitherto adopted has been by least squares, with 
assumption of equal variance for all sample estimates and no correla- 
tions between them. In practice these conditions are rare, and this 
paper describes an analysis based on weighted least squares to allow for 
unequal variances and non-zero correlations. The validity of weights 
derived from a simplified “normal” model is considered for variance 
genetic systems, and the efficiency of ‘“‘standard” estimates is given 
for certain cases. The new model is applied to published and unpub- 
lished data. Finally, iraplications of the analysis for the design of 
experiments of this type are discussed. 


A. W. NORDSKOG and O. KEKMPTHORNE (Iowa State 
538 College, Ames, Iowa). Importance of Genotype-Environmental 
Interactions in Random Sample Poultry Tests. 


So-called Random Sample Poultry Tests are currently popular in 
the U.S. as a means of demonstrating the productive characteristics of 
various strains of chickens offered for sale by commerical breeders. 
Most of these tests are conducted at a single location with entries in 
single or duplicate pens. If genotype-environment interactions are 
important then such single-location tests overestimate average genetic 
differences between strains. Moreover duplicate pen differences would 
not be a valid experimental error. Inconsistent performance of the 
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same entries from year to year in the same test location is a possible 
consequence of genotype-environment interactions. 

Results obtained from an Iowa multiple-location random sample 
test demonstrates that genotype-environment interactions may account 
for from 4 to 17 per cent of the total variance among pen means in age 
at sexual maturity, egg production, and egg size. If these same inter- 
actions can be considered typical of random sample tests in general, then 
the single-location test overestimates average genetic differences be- 
tween strains from 25 to 50 per cent. At the same time the variance of 
duplicate pen differences underestimates sampling error for testing 
strain differences from 30 to 50 per cent. 

Some aspects of the design of random sample tests aimed at estima- 
tion of the magnitude of these interactions as well as at the primary 
purpose of comparing entries are discussed. 


H. F. ROBINSON, C. C. COCKERHAM, and R. H. MOLL 

530 (North Carolina State College, Raleigh, North Carolina). Studies 
on Estimation of Dominance Variance and Effects of Linkage 
Bias. 


Extensive investigations on level of dominance of genes for yield in 
corn have provided estimates in the range of overdominance, but such 
interpretation is questioned due to possible effects of linkage bias. 
These results from F, generations of hybrid populations, derived from 
intercrossing inbred lines, are in contrast to the partial to complete 
dominance indicated in random breeding open-pollinated variety popu- 
lations. The possibility of greater biasing effects of linkage on the 
estimates of the dominance variance versus additive genetic components 
used in computing level of dominance is given. 

The paper presents preliminary experimental information from 
studies designed to evaluate the magnitude of the linkage bias where 
F, and advanced generation material of a hybrid population is back- 
crossed to inbred parents to produce the families for simultaneous 
testing. In these studies, previously designated Design III, consistent 
reduction was observed in estimated dominance variance in the Fs, 
compared to the F, generation. Due to similar decreases in estimates 
of additive genetic variance, the level of dominance in later generations 
where recombination opportunity had been provided was not appreci- 
ably changed from the dominance level of the F,. Changes in variance 
components following linkage dissipations are interpreted in terms of 
effects of both coupling and repulsion linkages. 
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G. A. MARTIN and A. E. BELL (Purdue University, Lafayette, 
540 Indiana). An Experimental Check on the Accuracy of Prediction 
of Response During Selection. 


With 12 highly inbred lines and a population synthesized from them 
as genetic material, population parameters needed for the prediction of 
response to high-low selection for body weight with inbreeding and 
changes in egg size, fecundity, and adult emergence during selection 
for body weight were estimated. These estimates were used to predict 
the course of events in replicated high-low selection over 15 generations 
with a breeding plan that imposed inbreeding at a rate of approximately 


3 per cent per generation. 


Realized heritability of the selected trait was slightly less than pre- 
dicted. Calculated coefficients of inbreeding exceeded the levels pre- 
dicted on the assumption that selection and inbreeding were inde- 
pendent. Genetic correlations of the unselected traits with body 
weight were highly transitory both before and during selection and were 
of little value for predicting association with the selected trait or change 
in the means of unselected traits. 

Effects of inbreeding were predicted on the assumption that heterosis 
is due to dominance effects of genes. The predicted values were in 
close agreement with those calculated independently in each line and 
generation. Inbreeding depression was most severe for fecundity, 
large for adult emergence, very small but definite for egg size, and of 
little consequence for body weight. 

Divergence of body weight was well predicted and highly repeatable, 
whereas directional. response to selection had lower predictability and 
repeatability. 


R. MASON, H. NICHOLSON, R. BOGART, and H. KRUEGER 
541 (Oregon State College, Corvallis, Oregon). Preponderance of 
Hybrid Losses or Negative Heterosis in Mouse Crosses. 


Four OSC strains of mice—A, V, O, and C—were mated together in 
all possible ways, producing 16 male and 16 female offspring groups. 
The characteristics measured in the offspring were: post-weaning 
growth, food consumption and efficiency of growth from 21 to 45 days 
of age, 45-day body weight, thyroid weight, micrograms thyroid per 
gram body weight and gram-minutes of survival in a sealed jar, number 
of mice born per litter, and 12-day litter weights of (F.) offspring pro- 
duced by random matings among the 16 offspring groups. 

The data were analysed for general and specific combining ability 
effects, for reciprocal effects; for sire-strain, dam-strain differences and 
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for sire-strain X dam-strain interactions; and for deviations of offspring 
from mid-parental averages. 

Maternal effects were outstanding for all of the growth and repro- 
ductive characteristics. The thyroid characteristics were inherited in 
an apparently additive manner and relatively free of maternal effects. 
Sex linkage of dominant genes for small size was apparent in the cross- 
bred mice of strain C. The physiological patterns in the parent mice 
were widely different and the crossbreeding frequently resulted in 
incompatibility between mother and offspring leading to negative 
heterosis, or the inability, in some cases, for the offspring to exploit 
their superior genotypes. It is concluded that for heterosis to be 
attained in mammals, where maternal effects can be profound, selection 
of parental material must be for similar physiological patterns. 


H. H. SMITH and D. 8. ROBSON (Brookhaven National Labora- 

542 tory, Upton, Long Island, and Cornell University, Ithaca, New 
York). A Quantitative Inheritance Study of Dimensions of 
Flower Parts in Tobacco. 


Measurements of corrolla tube length and lobe length were made 
over the period 1933 to 1955 on two inbred species of Nicotiana, N. 
sanderai and N. langsdorfii, and their derived generations through the 
F, and the second backcrosses. On the logarithmic scale environmental 
variances as measured in the nonsegregating generations were essen- 
tially constant through time and independent of genotype, though the 
effect of years (confounded with locations) was substantial and three 
was some indication of a genotype-year interaction. First degree 
genetic parameters including epistatic effects of triplets of non-allelic 
genes were estimated by a regression analysis of the weighted generation 
means. The model used was that of Fisher, Immer, and Tedin, extended 
through Kronecker multiplication to include all possible epistatic ef- 
fects. The results of the analysis suggest that epistasis occurs in this 
inheritance system; nevertheless, the simple non-epistatic, 3-parameter 
model fits remarkably well to the 13 generation means. 


A. F. PURSER (Animal Breeding Research Organization, Edin- 
543 burgh, Scotland). Experimental Use of Correction for Regression 
on a Second Character to Increase Efficiency of Selection. 


Both Rendel [1954] and Osborne [1957] have considered the efficiency 
selection of a character when correction is made for linear regression 
on a second character. They showed that, under certain conditions, it 
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is possible to make greater genetic progress by using a regression cor- 
rection than by selection of the first character alone. ; 

Similar ideas were developed quite independently and put into use 
in an experiment beginning in 1954 with a flock of hill sheep. The 
experiment was a two way selection program, with a control, to change 
body conformation of young lambs. Selection took place at normal 
castration time (i.e., about 2 months of age). A condition of selection 
was that body weight was to remain similar in all three lines. Cannon 
bone length was chosen as the main selection criterion and a correction 
was used for the phenotypic regression on body weight. 

The results of the first three generations of selection will be outlined. 
They show that in fact progress in changing cannon bone length is much 
greater when using the regression correction than when selection is for 
cannon length alone. 


C. K. CHAI (Roscoe B. Jackson Memorial Laboratory, Bar 
544 Harbor, Maine). Heritability of Iodine Metabolism in the 
Thyroids of Mice. 


A study of quantitative inheritance of thyroid activity in mice has 
been carried out. A mouse population was produced by intercrossing 
6 inbred strains, from which mice with full and half-sib relationships 
were produced. Determination of percentage of iodine output in the 
thyroids, which are closely related with rates of thyroid hormone secre- 
tion, has been made in the mice at ages of 120-150 days. This has been 
done by using radioactive iodine. Each mouse was injected subcu- 
taneously with J'* of 5-10 ue in 0.5 ml of saline. External thyroid 
counts were taken at different time intervals after the injections. 

So far a total of 246 mice has been determined. The percentage of 
iodine output in the thyroid of each mouse was computed by following 
the standard method. The mean percentage of iodine output in the 
thyroid was found to be 28.2 a day. An analysis of variance in the 
percentage output was made based on the different genetic relation- 
ships of the mice, and estimate of variance components associated with 
individual, dam, and sire effects was obtained. Following the method 
described by Osborne and others, we have obtained a value of 0.136 
for the heritability estimate of percentage iodine output in the thyroid 
glands of mice. Possible corrections and'sampling variance of estimate 


will be discussed. 
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545 M. J. R. HEALY (Rothamsted Experimental Station, Harpenden, 
Herts., England). Multivariate Growth Data. 


This paper consists of an interim report on some work being carried 
out by the author in collaboration with Dr. J. M. Tanner of the Institute 
of Child Health, University of London, and others. The basic data are 
believed to be unique; eight different measurements are available on 42 
boys and 388 girls at ages 0, 1, 2, 3, 4, and 5 years, and also as adults. 

The known techniques of multivariate analysis have not been much 
used in practice, largely due to the computational labor involved. This 
is now being overcome by the use of electronic computers, and it is 
important that these techniques should be tried out in practice so that 
they may be critically appraised and extended where necessary. The 
data described pose several problems for which known techniques seem 
appropriate. The results to date will be described and possible direc- 
tions for further work in this field will be discussed. 


M. B. DANFORD, H. M. HUGHES, and R. C. MeNEE (U. 8S. 

546 School of Aviation Medicine, Randolph Air Force Base, Texas). 
Multivariate Procedures in the Analysis of Repeated-Measure- 
ments Experiments. 


Repeated-measurement designs—periodic observations on the same 
set of subjects in each of several treatment groups—form a mixed model 
in which correlations often exist between observations on the same 
subject, and the testing of fixed effects if of prime interest. Exact 
multivariate tests are developed for these and applied to a set of actual 
data. Univariate tests, which have been shown to be valid under equal 
variance-equal covariance assumptions, are also applied to the example, 
along with tests of the latter assumptions. Results are compared with 
a third test of group-time interaction based on univariate analysis of 
slopes of individual time-curves. 


C. R. RAO (Indian Statistical Institute, Calcutta, India). 
547 Fiducial Bands and Goodness of Fit of Polynomial Regression 
When the Variables Are Correlated. 


When observations are taken on the same set of individuals at dif- 
ferent points of fime, as for instance, when the growth of the individuals 
over time or response to different doses of a drug, is studied, the analysis 
of variance methods of examining goodness of fit of polynomial regres- 
sion and other associated problems may not be valid because of the 
correlations between observations at different points. The appropriate 
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multivariate methods when no assumption is made about the nature of 
the correlation between the observations are developed in this paper. 

The specific problems considered are (i) fitting a polynomial curve 
of a given degree and examining its goodness of fit, (ii) providing fiducial 
bands to the average curve, and (iii) comparing the average curves 
under different experimental conditions. The solution to problem (ii) 
given by earlier writers has been improved to provide narrower bands. 


R. M. DeBAUN and A. M. SCHNEIDER (American Cyanamid, 
548 Stamford, Connecticut). Some Test of Assumptions in Multi- 
variate Analysis. 


In applied statistical work in the chemical industry, it is usually 
necessary to consider several response variables. An example is given 
of an experiment with two such variables, analysed by the multivariate 
analog of analysis of variance. Multivariate tests of homogeneity of 
error and of additivity are applied and it is shown that the usual assump- 
tions are not in fact satisfied. Validity of the analysis is restored by 
transforming the variables. 


G. 8. WATSON (Australian National University, Canberra, 
549 Australia). Some Recent Results in Chi-square Goodness of 
Fit Tests. 


The aim of this paper is to relate and extend some recent work on 
chi-square goodness of fit tests. There is no discussion of any problems 
which are specifically associated with more than one categorical variable. 
The main topics are the effect of estimation on chi-square and its parti- 
tionseand their relation to Neyman’s smooth goodness of fit test, and 
the effect of grouping a univariate distribution according to the disposi- 
tion of the sample on the distribution of the chi-square statistic. 


8. N. ROY and E. DIAMOND (University of North Carolina, 
550 Chapel Hill, North Carolina). Contributions to the Statistical 
Analysis of Categorical Data. 


This paper is based largely on the work of Roy, Mitra, 
Kastenbaum, and Diamond. For a single multinomial distribution 
n! (where >> p; = 1, and >> n; =_n (fixed), and i 
itself may be a multiple subscript), large sample tests "of the following 
types of hypothesis are discussed: (i) H):p; = p;(@, , , 4), 
—1> s], Gi) Ho: = --- , 6) with ,--- , 6) = 0, 
k = 1, --- ,t > 8, (iii) The parts of (ii) reversed, i.e., the hypothesis 
taken as the model (condition) and vice versa. Likewise, for the product 
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of several multinomial distributions, [n.;! 
where >> p,;; = p.; = 1, and > n,;; = n.; (fixed), and i and j may be 
multiple subscripts, large sample tests for hypothesis similar to (i), (ii), 
and (iii) for a single multinomial distribution are discussed. These 
hypotheses are then particularized to cover, for categorical data, a 
large class of questions, including a generalization of all those that are 
asked in the usual analysis of variance and in multivariate analysis. 
Asympototic powers (in the sense of Pitman) of these tests are discussed 
together with their physical implications and with the asymptotic inde- 
pendence of tests of several components of a complex hypothesis in each 
type of situation. The distinction between a “‘variate” and “a way of 
classification” is brought out in this discussion. An introduction is 
given to a theorem on several variates, some of them continuous and 
some categorical. 


N. F. GJEDDEBA:K (A/S Ferrosan, Copenhagen, Denmark). 
551 Contribution to the Study of Grouped Observations. IV. Some 
Comments on Simple Estimates. 


When data from a normal distribution are coarsely grouped we can 
form simple estimates of the mean and variance by referring observa- 
tions in each group to a single abscissa falling within the group 
boundaries. It is shown that for equidistant grouping, the simple 
estimate of variance with Sheppard’s correction is almost as efficient as 
the maximum likelihood estimate, at least for sample sizes up to 100. 
The maximum likelihood estimate is useful in very large samples or 
when the group intervals are not all equal. 


F. B. LEECH and M. J. R. HEALY (Rothamsted Experimental 
552 Station, Harpenden, Herts., England). Analysis and Interpreta- 
tion of Experiments on Growth Rate. 


Special problems of analysis and interpretation arise when repeated 
measurements are taken on the units of an experiment. This situation 
commonly arises in experiments on the growth rate of animals but also 
occurs in many other contexts. For convenience, the terminology of 
growth rates will be used here. Results of this kind are often analysed 
similarly to split-plot experiments, the successive readings taking the 
place of the sub-plots. This analysis is in general invalidated by the 
correlations between successive observations which in this context can- 
not be taken care of by randomization. 

In the simplest case to be considered, the effect of a treatment will 
be to cause the mean growth curve of treated animals to diverge from 
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that of control animals to an extent which is linearly related to the time 
elapsed since the treatment was applied—we can then completely de- 
scribe the treatment effect over the period considered as simply = lb. 
per week extra gain in weight. The problem then arises of how best to 
estimate x. Wishart in 1938 broke down each individual growth curve 
into its mean and linear, quadratic, ete. components, and implied that 
x would be estimated by way of an analysis of the linear components. 
However, if the treatments have been allotted by a proper random 
process, the true mean growth curves must coincide at the time at 
which the treatments were first applied, and it is then clear that a non- 
zero treatment effect will also cause a difference between the means of 
the two growth curves. The best estimate of x will thus be a weighted 
combination of the separate estimates derived from the means and the 
linear components, with weights depending on the variances and co- 
variances of these components. The paper describes the assessments 
of these weights, the added efficiency due to utilizing the estimate based 
on the means, and extends these ideas to more complex situations. 


P. G. HOMEYER, R. L. PRESTON, and W. BURROUGHS 

553 (Iowa State College, Ames, Iowa). The Design and Interpretation 
of a Bioassay for Detecting Small Quantities in the Presence of 
Interfering Substances. 


In connection with some cattle feeding experiments at Iowa State 
College several tissues of cattle were assayed for estrogenic activity of 
possible residues from oral feeding of small quantities of diethylstil- 
bestrol. Any possible extraction of these residues from the tissues was 
ruled out. 

The assay animals used were immature female mice and the measure 
of response was uterine weights. The main difficulty in determining 
an assay procedure was that tissues of cattle not fed this drug are known 
to contain substances that influence uterine weights. Also, other 
ingredients in diets producing normal mice gains in body weight might 
produce estrogenic activity. In addition, an assay procedure with high 
precision for estimating extremely small quantities was needed. 

After several preliminary assays, the procedure used for each tissue 
was to include a high constant level of the tissue in the diets used to 
estimate the unknown response curve. The diets used to estimate the 
standard response curve all included the same level of the tissue from 
cattle not fed diethylstilbestrol. The same three known levels of 
diethylstilbestrol were added to the diets for both standard and un- 
known. There were four pens of five mice on each level for both 
standard and unknown and a randomized block design was used. 
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The diet for each level of added stilbestrol (x;) for both standard 
and unknown contained a constant unknown level of estrogenic pro- 
ducing substances (C) found in normal tissues and the unknown con- 
tained an additional level (U) due to feeding diethylstilbestrol. The 
linear response curves for standard and unknown are: Ys = 9, + 
b,(a; + C), Yo = j + b.(x; + C + U). If the regressions are linear 
and parallel, then U = (gj, — g,)/b. 

The statistical and biological consequences of this and other models 
are considered. Results are given for several assays. 


W. T. FEDERER (Cornell University, Ithaca, New York). 


at Treatment Design and the Interpretation of Experimental Results. 


The planning of an experiment is one of the most important if not 
the most important phase in the successful and efficient conduct of an 
experiment. This important part of experimentation does not receive 
the attention that it should. One phase in planning experiments is the 
selection of treatments or the “treatment design.” On the average, it 
could be said that treatment design is as important as the combined 
effect of all other factors related to the successful completion of an 
experiment. This includes the selection of an experimental design, size 
and shape of experimental unit, number of replicates, control of biases, 
summary and interpretation of experimental results, ete. The com- 
parisons that can and should be made are determined to a large extent 
by the treatments included in the experiment. The treatment design 
determines whether or not evidence on specific questions can be ob- 
tained. 

A set of selected examples will be used to illustrate the importance 
of treatment design and the questions on which evidence is available 
given a specific set of treatments. The treatment design for each 
experiment must be considered individually and in light of the hy- 
potheses to be tested. 


J. H. EDWARDS (University of Birmingham, Birmingham, 
England). Some Estimators Relevant to 2 X 2 Tables and the 
Implications of Some Almost-Invariant Features of Normal 
Curves and Surfaces. 


555 


The simplicity with which significance tests can be done on 2 X 2 
tables has tended to eclipse the use of relevant estimators, without 
which sets of data cannot be combined without inventing or destroying 
information. Some estimators and approximate estimators, with their 
sampling variances, are briefly discussed. 
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Two approximate estimators are discussed which are almost in- 
variant in relation to the plane of dichotomy, and which in many 
circumstances may be said to show semantic invariance. They +3 
closely related to Yule’s coefficients Q and Y, the correlation coef- 
ficient r and its transform z. 

Consider the 2 X 2 table ae and let e’ = be/ad and m* = 
(a + b)(ec + d)(a+c)(b +d). We have the following table. 


Estimator Variance 
(be — ad)/m? 1/N 
be 1 1 
c d 1 1 1 1 
> 
Cc b ac 4 bd 
ate b+d (a+c) (b+d)° 
2 
tat 42) 
r = tanh z tanh r 
Q ~ tanh * 2 ~ tanh 
2 y 
Y x tanh tanh = = Y 
4 
—* 
x ~ +5 + 1) 


J. H. EDWARDS (University of Birmingham, Birmingham, 
556 England). Some Aspects of the Interpretation of Differences in 
the Internal Environment. 


In surveys relevant to environment hazards it is now common to 
regard tests of statistical significance as of more importance than esti- 
mators and to make estimation conditional on significance. It is shown, 
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with reference to a simple mathematical model—which closely simu- 
lates the available data on the association between serum cholesterol 
and coronary thrombosis—that 


1) The parameters defining the model are related by a simple equation. 
In the case of coronary artery disease and serum cholesterol, numer- 
ous surveys of various pairs of these indicate that the parameters 
are consistently related on the assumption of a causal relationship. 

2) Mean differences between populations may be very important in 
the explanation of morbidity differences, and yet may be too small 
to be revealed by significance tests on quite extensive surveys. 

3) If estimation is conditional upon statistical significance, the esti- 
mators will be biased due to the selection of concordant sampling 
effects. 


C. M. WANG (National Taiwan University, Taipei, Taiwan). 


ae Some Problems of Statistical Analysis of Fractions. 


1. Assumptions of normality and homoscedasticity of data under 
which the analysis of variance is valid for pertinent significance tests of 
variance-ratios such as the t-test, F-test, etc., are frequently not satisfied 
by the experimental data in fractions (or percentages). For the purpose 
of normalization of data and at the same time, stabilization of their 
variance, various operating methods of transformation or weighting are 
recommended by R. A. Fisher [1922 and 1930], Bartlett [1936 and 
1947], Bliss [1937], Cochran [1938, 1940 and 1943] etc., among whom 
Cochran [1943] proposed the inverse-variance-weighting procedure for 
analysis of variance on fractions (or percentages) based on unequal 
numbers. It appears, as he recognized, that the more closely the 
weights used approach the true weights, the smaller is the disturbance 
in levels of significance, on which, however, no investigation has yet 
been made. 

2. In the present paper, the writer has estimates of the disturbing ef- 
fects on the levels of significance due to eight kinds of transformation or 
weighting. Two of these are new; namely, inverse-variance-weighted 
angles (in degrees) and the weighted second-approximation of angles. 
Methods used here consist in estimating values of d, , d; , and d, as 
suggested by Box and Anderson [1954], the first indicating the effects 
of non-normality, the second the effects of heteroscedasticity, and the 
third being the effective degrees of freedom (h, and h,) converting 
factor somewhat like the combined effects of the first two. If there is 
no disturbance in levels of significance, all d, , ds , and d, will converge 
to unity and vice versa. 
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In addition, Tukey’s [1949] orthogonal mean-squares due to non- 
additivity in the mathematical model for variates are estimated and 
tested for significance, as disturbance from this source may invalidate 
significance tests. Watson’s [1936] and Salisbury’s [1943] experimental 
data are used for the purposes of Monte Carlo methods to elucidate the 
effects of disturbance from these three sources numerically. 

3. Although reactions of these two sets of data to the eight operations 
vary, it appears that the inverse-variance-weighting tends to flatten the 
frequency distribution of variates. 

4, From the point of view of simplicity, the two square-root trans- 
formations (~/a and +/a + 0.5) have superiority over all other opera- 
tions. There are additional advantages that square-root transformed 
data have an approach to unity of d, , d; , and d, no less than others and 
that this transformation is equivalent to partially weighting Vi by 
the inverse standard error of f so that the disturbance in levels of sig- 
nificance may decrease. Therefore, where the existence of homosce- 
dasticity in square-root transformed data has proved no less satisfactory 
than others, we are justified in using the square-root transformations 
for simplicity. If at the same time we compute d, and thereby the 
two effective degrees of freedom (h, = n,d, , and hy = nd,) to ascertain 
the level of significance, a relatively valid test for significance is possible. 
Numerical examples show that the square-root transformation has a 
higher superiority over other procedures in Salisbury’s data than in 
Watson’s data. 

5. For the particular data discussed, the transformed values only 
rarely exhibit non-additivity. 

6. Finally, a few words should be added that sums of square about 
a most likely value as origin should be given by the direct (if such it is) 
methods: Sum-of-squares about £ = >> (x — #)? where x = a random 
variate involved and # = the most likely value, or otherwise, the sum- 
of-squares may be either over- or under-estimated, if the alternative 
indirect formula: Sum-of-squares about £ = >> 2” — nz’ is employed. 


V. SAHLEANU and R. HOLBAN (Institute of Endocrinology, 
558 Bucarest, Rumania). Méthode Statistique Type pour 1|’Interpré- 
tation des Expériences Hématologiques. 


C’est un paradoxe que dans le domaine le plus quantifié de la mor- 
phologie, l’hématologie, les résultats expérimentaux sont les plus diver- 
gents. Nous croyons que l’explication peutétre trouvée dans |’emploi 
presque exclusif d’une statistique simpliste, arithmétique, dans le calcul 
des résultats; tandis que les éxigences scientifiques réclament l’emploi 
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d’une statistique Glaborée, probabiliste, et aussi d’un procédé standard. 

Nous proposons une “standardisation” trés simple des étapes et 
des méthodes de calcul, qui n’est par trés rigoureuse au point de vue 
mathématique, mais qui est trés maniable. Les points essentiels sont 
les suivants: 

1. Tous les résultats, provenant soit de mensurations, soit des 
dénombrements, soit d’évaluations procentuelles, sont exprimés par 
luers moyennes arithmétiques avec leur erreur-type calculées par la 
méthode employée pour le calcul des moyennes dimensionelles dans 
Vhypothése d’une distribution normale. 

2. Pour diminuer la variance d’échantillonage, chaque animal est 
son propre témoin. Les résultats obtenus sont les rapports entre le 
chiffre final et le chiffre initial, multipliés par 100. Chaque différence 
envers 100 représente la variation procentuelle, qui a une plus grande 
signification biologique que la différence absolue obtenue en soustrayant 
les valeurs, et qui “homogénise” les variation interindividuelles. Aussi, 
le calcul nous montre que, dans ce chiffre relatif, une erreur de détermi- 


nation donne par sa propagation un écart plus petit que l’erreur des 
différences. 


550 N. T. GRIDGEMAN (National Research Council, Ottawa, 
Canada). The Lady Tasting Tea. 


Fisher has described experimental designs suitable to the testing 
of the null hypothesis that two groups of items are sensorily indistin- 
guishable. His exposition is here re-examined in the light of amend- 
ments suggested by Neyman and by Wrighton. The following topics 
are discussed: (i) the sampling of hypothetical versus real populations, 
(ii) the notion of an empirical probability associated with sensory 
acuity, (iii) the construction of a mathematical model to accommodate 
a variety of sorting designs, (iv) the power function in testing contexts, 
(v) the role of randomization, and (vi) the problem of “inexact” accept- 
ance regions in discrete distributions. The conclusion is that Fisher’s 
approach to the testing of the lady’s tea-tasting claims and his general 
attitude to analogous tests are not easily gainsaid. 


560 W. LUDWIG (Zoological Institute, Heidelberg, Germany). 
Theoretical Calculations, Model Tests, and Empirical Proofs. 


The paper gives a resumé of some biomathematical problems in- 
vestigated with coworkers. Given a problem, at first we make a mathe- 
matical model, prove this by a practical one with peas, little balls, or 


158 BIOMETRICS, MARCH 1959 


random numbers, and, finding the calculations correct, we apply the 
formulae to natural observations (experimental or from nature). 

There are two stochastical questions. The one concerns Pearson’s 
coefficient of variation (VC) as a measure of variability. It should be 
constant, e.g. for the bodyweights of related species, or for the weight 
of body of a species and its body-parts, e.g..the second right leg. The 
first question we investigated on different species of cockroaches, the 
second on the waterstrider Gerris. Here the constancy of the VC is 
exactly realised in nature for a weight-interval 2000 :1. For the cock- 
roaches the interval is smaller. The constancy of the VC can be ex- 
plained theoretically by an Arley-process (cell division). 

The second stochastical problem concerns a generalization and a 
modified explanation of the frequencies of genera with x species (log- 
series). The first (A) was a technical one. How many spores of a 
bacillus must 1 ml suspension contain so that, if I pour a mixture of 
1 ml suspension and 6 ml agar in a petri-dish, the probability that in 
the cooled agar 2 spores lie at a distance less than x mm is smaller than 
1%. Application to Bacillus mycoides. (B) We imagine a large plain 
convex region, e.g. a table. Silver dollars rain slowly down. Each 
dollar falling beside the table or protruding over the edge, or over- 
lapping a dollar already lying, is removed. After some time the region 
is filled in the sense that each further dollar would overlap at least one 
lying dollar. Wanted are the distribution, mean, and variance of the 
numbers of dollars filling the region. (C) n circles of (constant) radius 
r are distributed in a convex region. Wanted is the minimum of n so 
that the region is filled. (D) Wanted is the minimum of n so that the 
region is just completely covered. (£) Wanted is the maximum of n so 
that the region is filled without overlapping. (F) In a plane n points 
are distributed randomly. We measure from each point the distance 
to its nearest neighbour. Questions: 1. Supposing each point is the 
centre of a circle of constant radius r, is the plane filled? 2. Supposing 
the plane is filled, what is the best estimate of r? Empirical examples. 

Finally a mathematical sketch about the evolution of sex is given: 
(plasmagenes-chromosomes-mitosis) +- (chromosomes-meiosis) leads to 
the beginning of sex-differentiation. 


A. S. FRASER (Haberfield, New South Wales, Australia). Simu- 
lation of Genetic Systems by Automatic Electronic Computers. 


It is possible to simulate segregation, recombination, fertilization, 
and selection on automatic electronic computers. This allows the 
construction of genetic models of a very high degree of complexity, 
and, by running a computer with a program based on such models, the 
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examination of the effects of selection, etc. on complex genetic systems. 
This method is illustrated for two types of model. ‘The “‘fixed’’ parame- 
ter model is one in which all the parameters of the genetic system are 
specified at the beginning of a run and do not vary during the run. The 
program which illustrates this is that described by Fraser [1957], in 
which the effect of linkage on the rate of advance under selection was 
examined. Several runs were made at different ages of population, and 
different degrees of linkage, comparisons between runs allowing evalua- 
tion of the importance of linkage. 

The ‘variable’ parameter model differs in that some of all of the 
parameters can vary during a run; the range of values between which 
a specific parameter can vary is specified, but the actual value for any 
individual is determined by the individual’s own genotype, e.g. domin- 
ance can vary from individual to individual. 

The results of runs of this type of program are evaluated in terms 
of the frequencies of the equilibrium values of the various parameters. 
This method allows fast determination of the high probability ranges 
of the genetic constants, e.g. dominance, epistasis, etc. 

The results obtained from these two analyses are discussed in 
relation to the general ability of this method. 


C. I. BLISS (Connecticut Agricultural Experimental Station, 
562 New Haven, Connecticut). Periodic Regression in Biology and 
Climatology. 


Periodic regression is applied here to cyclic phenomena in biology 
and climatology in which (1) the length of the cycle, such as a year or 
day, is determined independently of the response, (2) the observations 
are spaced evenly through the cycle, and (3) the number of replicates 
is constant at each interval. When the response (y) changes symmetri- 
cally through the cycle, the first harmonic or sine curve is defined 
by the mean response (a)) and two orthogonal regression coefficients, a, 
for the cosine u, and b, for the sine v, , as Y = a) + au, + b,v, , from 
which we can compute its amplitude and phase angle. When the curve 
is not symmetrical, the sine curve can be extended with additional 
terms for two, three, or more cycles in each fundamental period by 
classical Fourier analysis until the desired fit ,is achieved. 

The analysis of variance for deciding how many terms to retain in a 
Fourier curve and for determining its error is based upon the mathe- 
matical model for replicated regression. In effect, a Fourier curve is 
fitted to each replicate and the analysis determines in what respect 
these separate curves differ from replicate to replicate. Various aspects 
of the calculation are illustrated by the monthly mean temperatures 
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in New Haven over a 14-year period, the monthly iodine values of 
butterfat from five creameries in Alberta, and the electrical potential 
of an elm tree in eight three-day periods in August 1953. 

Both the number of terms in a periodic regression and the validity 
of its analysis depend upon the selection of a suitable unit for the re- 
sponse. The transformation to logarithms is applied to monthly data 
on two contagious diseases. The square root transformation for counts 
is illustrated with data on the hour of birth, where agreement with the 
assumed Poisson variation about a diurnal sine curve can be tested by 
x. The analysis of seasonal variation in the log-ED50 for a biocide 
or a drug is computed by maximum likelihood from all-or-none data 
with probits. A periodic response can be corrected for differences in a 
concomitant environmental factor by covariance, as illustrated by the 
adjustment for aperiodic humidity of the diurnal variation in the log- 
heat exchange of cows in an experimental barn. 

The precision of the constants for the first harmonic in a periodic 
regression is considered from two viewpoints. The first defines the 
variance of the statistics of a sine curve and the confidence limits of 
their parameters when each statistic is considered separately. The 
second defines a joint circular region within which any combination of 
the parameters for a, and }, is compatible with our observations at a 
given probability. 

Finer adjustments in periodic regression are examined with the 
monthly mean temperatures in New Haven. A correction for differences 
in the length of the month proved of minor importance relative to other 
errors in fitting a two-term Fourier curve. Seasonal changes in the 
variance through the year could be divided into two components, one 
representing differences between the observed monthly temperatures 
and their prediction by annual two-term Fourier curves, and the other 
differences between these predicted temperatures and the average two- 
term Fourier curve for all 14 years. For each source the log-variance 
changed periodically through the year in a sine curve, leading to esti- 
mated standard deviations for probability predictions and to weights 
for recomputing the average Fourier curve. 

A distinction is drawn between two objectives in periodic analysis, 
that of locating sources of variation and describing their characteristics, 
and forecasting, which must ordinarily be based upon the average over 
all replicates because of the unpredictable nature of long term trends. 
The summer temperatures contributed proportionately more to the 
weighted regression than the winter temperatures, a feature which may 
be potentially less desirable for climatological predictions than the 
simpler process of equal weighting through the year. In graphic tests 
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with rankits, the approximately random deviations from the yearly 
two-term Fourier curves proved to be satisfactorily normal, but when 
these were increased by the larger differences between the yearly and 
the average curves, the data suggest that seasonal departures from 
normality may modify probability predictions based upon an average 
Fourier curve. 

This paper has appeared as Bulletin 615 of the Connecticut Agri- 
cultural Experiment Station, June 1958. 


V. SAHLEANU (Institute of Endocrinology, Bucarest, Rumania). 
563 Perspectives de la Recherche Mathématique dans les Sciences 
Morphologiques. 


Nous considérons cinq directives de la recherche mathématique dans 
les sciences morphologiques, que nous exposons dans leur degré de com- 
plexité croissante. 

1. La biométrie descriptive correspond & l’assimilation des formes 
vivantes aux formes géométriques et 4 la détermination des grandeurs. 
La biométrie associée aux méthodes statistique permet |’étude de la 
variabilité, apporte sa contribution & |’établissement des normes et aux 
travaux de taxonomie et ouvre un champ vaste 4 la recherche des 
correlations. 

2. L’étude des rapports et des indices donne des indications concernant 
la forme, la structure, les proportions, l’harmonie, et |’équilibre de ia 
construction, l’importance relative des parties, etc. 

3. L’analyse géométrique des formes vivantes comporte: (a) l’analyse 
des propriétés des formes, pour trouver la “motivation” de leur présence 
dans le type d’organisation, leur “réle;’”’ (b) l’étude causale du déter- 
minisme physico-chimique de la form. Le probléme a des analogies avec 
ceux de la détermination des trajectoires ot des lignes de champ. 

4. Considérations géométriques concernant la croissance et la forme 
peuvent découvrir le sens biologique des quelques constatations ‘“de 
fait,’’ par exemple la structure ov la division cellulaire, la limitation de 
la taille des animaux terrestres, les modifications des formes et des pro- 
portions au cours de l’ontogenése et de la phyllogenése. 

5. La déduction mathématique en morphologie. Des hypothéses ex- 
primées comme équations différentielles peuvent étre confrontées avec 
les faits. Les problémes peuvent étre abordés ‘“formellement,”’ en cal- 
culant le phénoméne dans sa structure particulitre—en faisant abstrac- 
tion de la nature physique des facteurs mis en jeu; ot en calculant le 
phénoméne dans sa grandeur actuelle en tenant compte de la nature 
physique des forces et de leur intensité. Le domaine le mieux étudié se 
rapporte 4 la croissance des organismes. 
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V. SAHLEANU (Institute of Endocrinology, Bucarest, Romania). 


os La Cybernétique comme Point de Vue. 


Le point de vue de la “plastique” est celui qui s’oriente vers les 
aspects substantiels d ; objets analysés, par exemple la forme et la 
composition. On definit la vie par |’assimilation des substances et la 
maladie par la lésion. 

Le point de vue de I|’énergétique est celui qui met |’accent sur les 
capacités de travail. On congoit l’organisme comme une machine qui 
transforme |’énergie; la pathologie correspondante est celle dite “fonc- 
tionnelle.”’ 

Le point de vue qu’on peut nommer celui de la cybernétique met 
l’accent sur la qualité “d’information’”’ que possédent des petites quan- 
tités de substance ov d’énergie, par rapport 4 certains systémes organisés. 
Les influences d’ordre biologique et l’homéostasie sont analysés avec 
profit seulement de ce point de vue. En pathologie, les préoccupations 
corrélatives sont celles envers les troubles de coordination nerveuse ot 
humorale. 


J. YERUSHALMY and C. E. PALMER (University of Cali- 

565 fornia, Berkeley, California, and American Public Health Associa- 
tion, Washington, D. C.). The Strengths and Weaknesses of 
the Epidemiological Method. 


Investigations of etiologic factors in chronic diseases are often based 
on uncontrolled observations which are subject to considerable limita- 
tions. In infectious. diseases, the general acceptance of Koch’s postu- 
lates has contributed to the orderly and systematic identification of 
causative organisms in many diseases. 

In this paper, a parallelism of these postulates is attempted for 
chronic diseases. It is suggested that two essential types of evidence 
are involved in Koch’s postulates: 

I. The simultaneous occurrence, in the right sequence, of organism 
and disease, and 
II. The specificity of effect of organism on disease. 

Because in chronic diseases ‘“‘multiple causation’ is often encountered 
and especially because of the disturbing factor of self-selection, prob- 
lems in measurement, in selection of controls, and in testing for spe- 
cificity of effect arise. 

For purposes of discussion the following statements are suggested 
as a possible first approach to the development of acceptable guide 
posts in implicating a characteristic as an etiologic factor in chronic 
diseases: 
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I. The suspected characteristic must be found to occur more fre- 
quently in cases of the disease in question than in an appropriate 
group of persons without the disease. 

II. The disease must occur more frequently in persons possessing the 
characteristic than in an appropriate group of persons not pos- 
sessing the characteristic. 

III. The relationship between the characteristic and the disease must 
be tested for specificity by studying the relationship between the 
characteristic and other disease entities, and, if possible, between 
the disease and associated characteristics. The results of such 
study must indicate that similar relationships do not exist with a 
variety of characteristics and with many unrelated disease entities 
where such a relationship is not expected on physiological and 
pathological grounds. If such relationships are found to exist, 
the suspected characteristic cannot be said to be specifically 
related to the disease under question. 


566 8S. JABLON (National Research Council, Washington, D. C.). 
Clinical-Statistical Interaction in the Medical Follow-up Study. 


The clinician is oriented to the individual patient. He tends to 
syntiesize data in terms of the particular patient (e.g. into a set of 
diagnoses) and is acutely aware of the differences between individuals. 
He mistrusts statistics, often justifiably, as a lumping together of 
heterogeneous and diverse elements. The statistician seeks to gener- 
alize beyond an immediate set of patients. It is his task to find those 
common threads that link together the diversities of individuals. The 
success of collaborative effort in studies of the natural history of 
disease is dependent on the fruitful synthesis of these two points of 


view. Examples will be given from Government-sponsored follow-up 
studies. 


K. CARTER (Ames Company, Inc., Elkhart, Indiana). Clinical 
567 Trials and the Biometrician—A Pharmaceutical Company Medical 
Viewpoint. 


The decision to introduce a new therapeutic agent involves im- 
portant moral and economic considerations. Premature release may 
result in serious consequences for the patient, for the doctor, and for 
the pharmaceutical company. The decision should be based on objec- 
tive data related to important clinical criteria. The biometrician in 
the university or in the pharmaceutical company has a key position and 
a unique opportunity to insure that the clinical trial is carefully designed 
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and controlled so as to obtain objective data from which valid conclu- 
sions and important decisions can be made. Problems, examples, 
procedures, and opportunities related to this kind of work will be 
presented. 


D. MAINLAND (New York University College of Medicine, 
568 New York,. New York). Some Non-statistical Problems Met by 
Medical School Statisticians. 


Anyone who is appointed as a general consultant statistician in a 
medical school must expect to meet difficulties, which are often of 
psychological origin. Drawing on the experience of a medical graduate, 
research worker, and teacher, who subsequently became a medical 
statistician, this paper will explore the origins of some of the difficulties, 
and will discuss methods by which a young statistician, with inclination 
toward medicine, might try to protect himself against some of the 
dangers that he would meet in that field. 


8S. ROSIN (Zoological Institute, University of Berne, Berne, 
569 Switzerland). Biometrical Methods Applied to Problems of 
Blood-Group Distributions. 


For the statistical evaluation of a large strongly divided ABO blood- 
group material, methods are being developed which permit graphical 
homogeneity tests. 

The evaluations are based on the calculated gene-frequencies com- 
puted from the blood-group phenotypes and finally on two correlated 
variables. By an arc-cos transformation it is possible to find a homo- 
geneous coordinate system sufficient for practical purposes. In this 
system confidence circles can be related to the sample means, and as a 
result the statistical work on such large material can be greatly simpli- 
fied. Furthermore it is shown that with the aid of colors a meaningful 
presentation of the blood-group relationships may be indicated on a 
single geographical map. 

The methods have been applied to extensive Swiss data in a paper 
appearing in the Arch. Julius Klaus-Stift 31, 17-127, 1956. 


D. E. DAVIS (The Johns Hopkins University, Baltimore, Mary- 


_ land). The Behavioural Assumptions in Census Methods. 


The problems of estimating populations of wild animals have at- 
tracted the attention of statisticians for a couple of decades. The 
methods developed have necessarily made assumptions about the be- 
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haviour of the animals so that the population could be estimated. The 
present paper will explore our knowledge of behaviour in reference to 
these assumptions; it will ignore the strictly statistical assumptions. 

Methods of estimation may be divided into several types. (1) Di- 
rect counts assume that all are equally detectable. (2) Indices assume 
that ratios at several times and places are the same. (3) Rates of 
capture assume that all are equally trappable. (4) Recapture assumes 
among other things that all are equally retrappable. A number of 
examples from particular species illustrate the extent of applicability 
of these assumptions and the nature of future work. 


R. E. BLACKITH (Imperial College Field Station, Silwood Park, 
571 England). The Representation of Patterns of Growth in Grass- 
hoppers by Vectors. 


The patterns of growth in grasshoppers have been studied by 
bringing together the evidence afforded by several methods of multi- 
variate analysis. Ten morphometric characters were measured in 1,427 
adult insects, and the latent vectors of the residual dispersion matrix 
identified with independent modes of growth by comparison with 
factors rotated so as to give ‘simple structure.’ 

These vectors were then considered in relation to the discriminant 
functions which best separate known polymorphic types within the 
various species; sexual dimorphism can be expressed by nearly parallel 
vectors in different species, whereas the differences of shape associated 
with colour polymorphism can be shown to reflect partly size differ- 
ences, partly distinctions of form which are not consistent from one 
genus to another. 

The discriminant functions form a network in the 10-dimensional 
space delineated by the latent vectors as axes, this network commonly 
resolving itself into a generalised distance chart whose smaller dimen- 
sionality reflects the number of fundamentally distinct modes of growth 
actually elicited (the number of distinct polymorphic contrasts ob- 
served), whereas the number of latent roots distinguishable from an 
isotropic residue represents the potential number of polymorphic con- 
trasts of which the insects are, in principle, capable. 

The analysis of the latent vectors may show that certain modes of 
growth are independent even though a generalised distance chart dis- 
closes that, in the conditions under which the insects live, more than 
one such mode may be needed to give rise to a particular polymorphism. 
Some implications of such correlated modes of growth for the study of 
allometry are noted. 


‘ 
~ 
2 
- 
on 


166 BIOMETRICS, MARCH 1959 


E. R. RICH (University of Miami, Coral Gables, Florida). The 
572 Role of Sexual Behaviour in the Population Dynamics of Tribo- 
lium. 


In a population of such an organism as the flour beetle, Tribolium, 
the probability that a given female will produce exactly one egg in a 
specified interval of time has been shown to be influenced by the history 
of the female and the prevailing environmental conditions. Age and 
status as to whether a virgin or not are obvious factors. There are 
strong indications that the density of the population affects this pro- 
bability and one of the many ways in which this factor may act is 
through the time since the last copulation. This has been implicated 
in earlier work and here, through a series of experimental steps, the 
question of the pattern of the initiation of egg production is examined. 

Based on assumptions of random movement, the frequency of con- 
tacts would be expected to be an increasing linear function of the 
density. This was confirmed by a series of observations. The logical 
extension of this idea to frequency of copulation was not borne out by 
the observations and in fact, for the moderate densities studied the 
copulation frequency appeared to be independent of the number of 
males present. Some aspects of frequency, duration, and effectiveness 
of copulation are discussed. 

A model for the initiation of egg production which is based on the 
classical occupancy problem is presented. Here the number of balls 
being distributed among the compartments is compared to the number 
of copulations being distributed as a Poisson variable dependent on the 
number of males. and females, time, and the propensity to copulate. 
This model is subjected to experimental test with inconclusive results. 
Some speculations are advanced concerning the frequency of copulation 
as a factor in the outcome of interspecies competition. 


573 F. 8. ANDERSEN (Statens Skadedyrlaboratorium, Springforbi, 
Denmark). Competition in Populations of One Age-Group. 


Based on the rather voluminous literature on competition in popu- 
lations consisting of a single age-group, the following mathematical 
models are given: 

1. When larval cannibalism is not involved, the fraction surviving 
to the imaginal stage is often linearly related to the initial number of 
larvae. 

2. In a case of larval cannibalism, the logarithm of the surviving 
fraction is linearly related to the initial number of larvae. 

3. In a case of differential mortality of two genotypes composing a 
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population, the mathematical model is graphed as a straight line for the 
total surviving fraction, and two parabolas concave towards it for the 
surviving fraction of the two genotypes, the abcissa being the initial 
number of larvae. Also the ratio between the surviving number of the 
two genotypes forms a straight line against this abcissa. 

4. In the case of competition between larvae, the number of eggs 


. per female (and the female weight) is linearly related to the reciprocal 


of the initial number of larvae, and 

5. In the case of competition between adults, the fecundity is often 
linearly related to the reciprocal of the number of adults, but 

6. In some grain and seed pests the equation is complicated by the 
introduction of a linear term for mutual interference. 

7. In some rare cases the sex ration is influenced by the density. It 
is linearly related to the density, 


a) in Tineola with chromosomal and random determination of the 
sex, 

b) in a hymenopterous parasite with chromosomal, but non-random 
determination of the sex, 


c) in Moina (Cladocera) with phenotypical determination of the sex. 


Critical reviews have not yet been made of the literature dealing 
with the effect of density on the duration of development and on animal 
movements. 

A hypothetical explanation is given of the beneficial effect of larval 
crowding in some Lepidoptera. 

Populations consisting of a single age-group are thought to be 
common in nature, but so are those composed of many age-groups, and 
future investigations must deal with competition in such populations 
as well as the distribution in time of the effect of density in populations 
consisting of a single age-group. 
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Région Belgique et Congo Belge 
Activités 1958 de la Société Adolphe Quetelet 


16 avril 1958 4 17 h. 30 (Fondation Universitaire): Conférence du Dr. 
W. J. YOUDEN du National Bureau of Standards: “Experimental 
Designs in Physics and Chemistry.” 

9 septembre 1958 (Institut d’Hygiéne et de Médecine Sociale de 
l'Université libre de Bruxelles) : 

Deuxiéme Journée Biométrique: 

R. A. BRADLEY 

Recent research on statistical problems in subjective testing. 

S. C. PEARCE 

Some recent applications of multivariate analysis to data from fruit 

trees. 

S. Ch. M. LEDERMANN 

Alcoolisation d’une population et mortalité. 

H. GRIMM 

Einige Probleme der Bakterienzahlung. 

D. J. FINNEY | 

Plant selection for yield improvement. 

F. B. LEECH 

Problems arising in the planning of surveys of animal diseases. 

N. MANTEL 

Preclinical cancer chemotherapy. 

L. MARTIN, J. FRANKSON et Anne LENGER 

Etude biométrique de l’épreuve d’hypoglycémie en clinique. 

10 septembre 1958 (Université libre de Bruxelles, Solbosch) : 

Symposium III sur la Biométrie et ses applications aux sciences phar- 
maceutiques: 

Sir Ronald A. FISHER 

Biometrical and statistical methods in pharmaceutical sciences. 

A. LINDER 

Applications pharmaceutiques de quelques méthodes statistiques 

élémentaires. 
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J. BERKSON 
The measure of error in bioassay. 
L. MARTIN et Ch. LIEBER 
Essai clinique d’anticholinergiques de synthése. 
R. BONTEMPS 
Résolution spectrophotométrique U. V. de mélanges binaires dans le 
cas d’interférence—Etude biométrique et expérimentale. 


British Region 


A meeting held on October 29, 1958 was devoted to multivariate 
analysis. The following papers were presented—R. L. Anderson: 
Recent advances in multivariate analysis; M. J. R. Healy: Multi- 
variate growth data; 8. C. Pearce and D. A. Holland: Applications of 
multivariate techniques to data from fruit trees. 

At the general meeting held on December 17, 1958, the following 
officers were elected for 1959: President—J. O. Irwin, Secretary— 
C. C. Spicer, Treasurer—P. A. Young, Committee members—S. Peto, 
M. Weatherall. After the business meeting the following papers were 
read and discussed: P. Armitage: An examination of some experimental 
cancer data in the light of the one hit theory of infectivity titrations; 
C. D. Kemp: Regression techniques for estimating the leaf area of 
grasses. 


ENAR 


The following officers have been elected for 1959: President—J. 
Cornfield; President-elect—W. T. Federer; Secretary-Treasurer—T. W. 
Horner; Committee members—W. G. Cochran, B. Harshbarger. 


Japan 


The Japanese members of the Society met in Tokyo on October 14, 
1957 and on April 8, 1958. The following papers were presented— 
T. Okuno and H. Ohata: Statistical analysis of variety trials on wheat 
and barley; T. Mineshita and C. Asano: Bioasay of small amounts of 
insulin; C. Asano and M. Arita: Optimum planning for the effects of 
drugs; M. Nei and K. Syakudo: Genetic variance and heritability in 
partially allogamous plants; T. Ishikawa and K. Takahashi: Estimation 
of relative potencies of vaccines; N. Kuzuya, K. Takahasi and T. 
Kashiwagi: Consistency of factor analysis results in six successive years; 
M. Masuyama and T. Okuno; Optimality of Latin-, Youden- and 
Skrikande-square designs; S. Yushikawa: Analysis of sensory test re- 
sults; S. Nakaganii: Statistical analysis of blood-sugar values; T. 
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Ichihara: Discriminant functions applied to external measurement on 
the fin whale. 

English abstracts of these papers can be obtained from the National 
Secretary for Japan, Mr. M. Hatamura, National Institute of Agri- 
cultural Research, Nishigahara, Kita-ku, Tokyo, Japan. 


MEETING OF THE BIOMETRIC SOCIETY, E.N.A.R. 


The Biometric Society (E.N.A.R.) will meet jointly with the Insti- 
tute of Mathematical Statistics and the American Statistical Associa- 
tion (Section on Physical and Engineering Sciences) at the School of 
Public Health, University of Pittsburgh, on Thursday, Friday, and 
Saturday, March 19-21, 1959. Three sessions have been arranged by 
the Biometric Society on Uses of Life Table Analysis in Health Research, 
Uses of Statistical Methods in Epidemiology, and Bioassays. These and 
other sessions will be jointly sponsored by the participating societies. 
Dr. J. O. Irwin will present an invited address, Biometrical Methods, 
Past, Present, and Future, at 8:00 p.m. on Thursday, March 19. An- 
nouncements regarding the entire program have been sent to members 
of the Biometric Society. 


WNAR ANNUAL MEETING 1959 


The WNAR annual meeting will be held on June 15-16, 1959 in 
San Diego, affiliated with the annual meeting of the Pacific Division 
of the AAAS. Visitors from other Regions are cordially invited to attend. 
Further information on accommodations may be obtained from: Dr. 
George A. Lindsay, Natural History Museum, Balboa Park, San Diego. 

The final program will be available about April 15 from the Sec/ 
Treas., WNAR, The Biometric Society, U.S. Naval Radiological De- 
fense Lab., San Francisco 24, California. 

Tentative session titles are: Fisheries Statistics, Medical Statistics, 
Genetics and Agriculture, and Contributed Papers. 

Abstracts of contributed papers should be sent to the program 
chairman: Dr. William F. Taylor, School of Public Health, University 
of California, Berkeley 4, California. 


CHANGES IN MEMBERSHIP 
(October 1958—January 15, 1959) 
Changes in Address 


Prof. Ishver 8. Bangdiwala, Box 1648 University Station, University 
of Puerto Rico, Rio Piedras, Puerto Rico 
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Mr. Rainald K. Bauer, Breitestr. 82, Krefeld, Germany 

Dr. George E. P. Box, Statistical Techniques Group, 167 Nassau Street, 
Princeton, New Jersey, U.S.A. 

Dr. A. E. Brandt, 2010 N. W. 14th Avenue, Gainesville, Florida, U.S.A. 

Dr. John W. A. Brant, 107 Highland Road, Southward Hills, York, 
Pennsylvania, U.S.A. 

Dr. Leroy S. Brenna, c/o The Texas Company, Research Laboratories, 
Beacon, New York, U.S.A. 

Dr. Samuel H. Brooks, 21903 W. Callado Way, Topanga, California, 
U.S.A. 

Mr. Lyle D. Calvin, 240 Extension Hall, Oregon State College, Cor- 
vallis, Oregon, U.S.A. 

Mr. Dennis Cooke, Glasshouse Crops Research Institute, Worthing 
Road, Rustington, Littlehampton, Sussex, England 

Mr. Norman Derek Edge, 15 Nelwyn Avenue, Hornchurch, Essex, 
England 

Dr. J. Gani, Department of Mathematical Statistics, Cohemiin Uni- 
versity, New York 27, New York, U.S.A. 

Mr. J. J. Grimshaw, “Highways, ”” 27 Musley Lane, Ware, Herts., 
England 

Miss Joan Gurian, 2400 Colston Drive, Silver Springs, Maryland, U.S.A. 

Dr. Richard J. Henry, 12330 Santa Monica Boulevard, Los Angeles 25, 
California, U.S.A. 

Dr. John R. Kinzer, Ohio State University, Columbus 10, Ohio, U.S.A. 

Dr. James L. Leitch, UCLA School of Medicine, Department of Nuclear 
Medicine and Radiation Biology, 10875 LeConte Avenue, Los 
Angeles 24, California, U.S.A. 

Mr. Francois-Michel Levy, 178 rue de Courcelles, Paris, France 

Dr. Ferdinand Merz, Fruhlingsstr. 20, Rimpar b. Wurzburg, Germany 

Dr. Eric Michalup, Apartado 6448, Caracas, Venezuela 

Dr. Paul D. Minton, 7317 Wild Valley Drive, Dallas 31, Texas, U.S.A. 

Prof. Frederick Mosteller, Department of Statistics, Harvard Univer- 
sity, 2 Divinity Avenue, Room 311, Cambridge, Massachusetts, 
US.A. 

Mr. Louis Munan, 628 A Street, N.E., Washington 2, D. C., U.S.A. 

Dr. Anita E. Rapoport, 3373 Rowena Avenue, Apartment 1, Los Angeles 
27, California, U.S.A. 

Dr. Ch. L. Rumke, c/o Vrye Universitat, 29 Prins Hendrikstraat, 
Amsterdam, Netherlands 

Dr. G. T. Scarascia, Divisione Biologica, C.N.R.N., v. le Belisario 15, 
Rome, Italy 

Dr. Walter Schreiner, Rethmar uber Lehrte, Bundessortenamt, Rosen- 
str. 11, Schnde/Hannover, Germany 
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Dr. Howard A. Schuck, Alaskan Air Command, Deputy for Operations, 
APO 942, Seattle, Washington, U.S.A. 

Dr. Francesco Sella, via Marchiondi 7, Milano, Italy 

Dr. A. R. Sen, Tocklai Experiment Station, Cinnamara P.O., Assam, 
India 

Mr. P. H. Smith, 40 Brooklyn Gardens, Cheltenham, Gloucestershire, 
England 

Prof. G. W. Snedecor, 1113 Bangor Street, San Diego 6, California, 
US.A. 

Mr. P. Sprent, East Malling Research Station, East Malling, Nr. 
Maidstone, Kent, England 

Mr. D. A. Sprott, Department of Mathematics, Waterloo College, 
Waterloo, Ontario, Canada 

Mr. M. van Albada, Beatrixlaan 19, Bennekom (Gld.), The Netherlands 

Mr. Raul Vargas, Department of Sociology, University of Chicago, 
Chicago 37, Illinois, U.S.A. 

Mr. D. R. Westgarth, The Forge, Brightwell Baldwin, Nr. Watlington, 
Oxon, England 

Dr. E. J. Williams, Cedar Point, Mackinac Island, Michigan, U.S.A. 

Dr. Lois M. Zucker, Laboratory of Comparative Pathology, R.F.D., 
Stow, Massachusetts, U.S.A. 


New Members 


At Large 


Dr. Elinar Pedersen, The Cancer Registry of Norway, The Norwegian 
Radium Hospital, Oslo, Norway 

Mr. Al-Salihi Rashid, Ahdamaih, Haibt Khtoon 34/23, Baghdad, Iraq 

Dr. Olav Reiers¢l, Matematisk og Mekanisk Institutt, Universitetet, 
Blindern, Oslo, Norway 


Belgian Region 
Mr. Pierre T. A. Dagnelie, 16 Avenue d’Avril, Bruxelles, Belgium 


British Region 

Mr. Munir Ahmad, Department of Statistics, Aberdeen University, 
Meston Walk, Old Aberdeen, Scotland 

Mr. A. E. Maxwell, Statistics Section, Maudsley Hospital, Denmark 
Hill, 8.E. 5, England 

Prof. F. L. Milthorpe, University of Nottingham, School of Agriculture, 
Sutton Bovington, Loughborough, England 
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Dr. M. W. Parkes, Pharmacology Laboratory, Roche Products, Ltd., 
Welwyn Garden City, Herts., England 


Eastern North American Region 


Mr. Russell B. Adams, 88 Arthur Avenue, 8.E., Minneapolis 14, Min- 
nesota, U.S.A. 

Mr. James T. Baird, Jr., Department of H.E.W., Washington 3, 
D. C., U.S.A. 

Mr. Leo Allen Bernat, 688 Holly Avenue, St. Paul 4, Minnesota, U.S.A. 

Mr. Richard S. Bingham, Jr., The Carborundum Company, Research 
and Development Division, P. O. Box 337, Bldg. 1-2, Niagara Falls. 
New York, U.S.A. 

Dr. Luther S. Bird, Department of Plant Physiology and Pathology, 
College Station, Texas, U.S.A. 

Mr. George J. Blakemore, Jr., 3 Pooks Hill Road, Bethesda 14, Mary- 
land, U.S.A. 

Dr. R. Darrell Bock, Psychometric Laboratory, University of North 
Carolina, Chapel Hill, North Carolina, U.S.A. 

Mr. Calvin C. Bolze, Midwest Research Institute, 425 Volker Boule- 
vard, Kansas City, Missouri, U.S.A. 

Mr. Edwin B. Bridgforth, Department of Preventive Medicine, Vander- 
bilt Medical School, Nashville 5, Tennessee, U.S.A. 

Mr. Thomas A. Budne, 3 Dunster Road, Great Neck, New York, U.S.A. 

Mr. Kenneth A. Busch, 1208 College Place, Raleigh, North Carolina, 
U.S.A. 

Dr. John C. Bailar, III, Yale University School of Public Health, 310 
Cedar Street, New Haven, Connecticut, U.S.A. 

Dr. Robert C. Carter, A. H. Department, Virginia Polytechnic Insti- 
tute, Blacksburg, Virginia, U.S.A. 

Miss Betty Caugherty, 1213 Carlisle Street, Natrona Heights, Pennsyl- 
vania, U.S.A. 

Dr. J. B. Chassan, Statistics Section, St. Elizabeths Hospital, Washing- 
ton 20, D. C., U.S.A. 

Mr. Robert T. Chatterton, Jr., Department of Animal Industries, Uni- 
versity of Connecticut, Storrs, Connecticut, U.S.A. 

Mr. David B. Christian, 121 Bastogne Avenue, Mishawaka, Indiana, 
U.S.A. 

Dr. G. H. Clark, Box 580, Harrow, Ontario, Canada 

Dr. James W. Cobble, Animal and Dairy Husbandry Department, Uni- 
versity of Rhode Island, Kingston, Rhode Island, U.S.A. 

Dr. Raymond O. Collier, Jr., 1994 N. Fairview Avenue, St. Paul 13, 
Minnesota, U.S.A. 

Miss Helen J. Coon, 102-B Rodman Road, Aberdeen, Maryland, U.S.A. 
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Mr. Tiberius Cunia, 5562 Basile Patenaude Pl., Montreal, Quebec, 
Canada 

Dr. M. Bryan Danford, Department of Biometrics, USAF School of 
Aviation Medicine, Randolph AFB, Texas, U.S.A. 

Dr. Richard F. Davis, Dairy Department, University of Maryland, 
College Park, Maryland, U.S.A. 

Dr. Richard J. DeGray, 15911 Scottsdale Boulevard, Shaker Heights 
20, Ohio, U.S.A. 

Mr. Richard L. Deininger, Bell Telephone Laboratories, Murray Hill 
Laboratory, Murray Hill, New Jersey, U.S.A. 

Mr. Wm. J. Dewey, 4102 Meadowbrook Lane, St. Louis Park 26, Min- 
nesota, U.S.A. 

Dr. Earl L. Diamond, Department of Biostatistics, Box 229, Chapel 
Hill, North Carolina, U.S.A. 

Miss Martha W. Dicks, Animal Industries Department, University of 
Connecticut, Storrs, Connecticut, U.S.A. 

Mr. James G. Dickson, Department of Plant Pathology, University of 
Wisconsin, Madison 6, Wisconsin, U.S.A. 

Mr. Victor A. Dirks, Agronomy Department, State College, Brookings, 
South Dakota, U.S.A. 

Dr. Tom Donnelly, Miller Hall, Chapel Hill, North Carolina, U.S.A. 

Miss Irene L. Doto, Kansas City Field Station, USPHS, 3900 Eaton 
Street, Kansas City 3, Kansas, U.S.A. 

Mr. Bruce A. Drew, 205 South 4th Avenue, Hopewell, Virginia, U.S.A. 

Dr. John W. Dudley, Field Crops Department, North Carolina State 
College, Raleigh, North Carolina, U.S.A. 

Dr. Acheson J. Duncan, Johns Hopkins University, Baltimore 19, 
Maryland, U.S.A. 

Mr. Harvey Eisenberg, 100 Center Street, Neptune, New Jersey, U.S.A. 

Mr. Magdi El-Kammash, P. O. Box 816, Chapel Hill, North Carolina, 
US.A. 

Mrs. Polly Feigl, 4720 Cedar Avenue, Philadelphia 43, Pennsylvania, 
USS.A. 

Dr. Vern L. Felts, 224 Stock Pavilion, College of Agriculture, Madison, 
Wisconsin, U.S.A. 

Mr. Donald J. Fisk, 235 Fellsway East, Malden 48, Massachusetts, 
U.S.A, 

Dr. Mare F. Fontaine, Forge Brook Meadows, Fishkill, New York, 
US.A. 

Dr. Stewart Hampton Fowler, Department of Animal Industry, Louisi- 
ana State University, Baton Rouge 3, Louisiana, U.S.A. 

Dr. Edmund A. Gehan, Biometry Branch, National Cancer Institute, 
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National Institutes of Health, Bethesda, Maryland, U.S.A. 

Mr. Robert L. Goldsmith, Sylvania Electric Products, Inc., Towanda, 
Pennsylvania, U.S.A. 

Prof. Leo A. Goodman, 1126 E. 59 Street, Chicago 37, Illinois, U.S.A. 

Mr. H. Stanley Graf, 58 Winans Drive, Yonkers, New York, U.S.A. 

Prof. David A. Grant, Department of Psychology, University of Wis- 
consin, 600 North Park Street, Madison 6, Wisconsin, U.S.A. - 

Mr. Richard A. Greenberg, 40 Greenfield Street, Hartford 12, Connecti- 
cut, U.S.A. 

Mr. Edwin F. Grey, Quartermaster, Research and Engineering Center, 
Natick, Massachusetts, U.S.A. 

Dr. Warren H. Gullen, 2308 N.E. Marshall, Minneapolis, Minnesota, 
US.A. 

Mr. Ivan K. Hadlock, P. O. Box 348, Phoebus, Virginia, U.S.A. 

Miss Glen Rae Hanemann, 635 North Pierce Street, New Orleans 19, 
Louisiana, U.S.A. 

Dr. Lester E. Hanson, Peters Hall-120, Institute of Agriculture, St. 
Paul 1, Minnesota, U.S.A. 

Dr. Walter Robert Harvey, Biometrical Services, ARS, Agricultural 
Research Center, Beltsville, Maryland, U.S.A. 

Miss Gweneth J. Hedlund, 2224 Jersey Avenue, 8., St. Louis Park 26, 

Minnesota, U.S.A. 

Dr. F. M. Hemphill, School of Public Health, University of Michigan, 
Ann Arbor, Michigan, U.S.A. 

Mr. Francis C. Henderson, 211 South 4th Street, North Wales, Pennsyl- 
vania, U.S.A. 

Mr. Philip Hermann, Jones and Laughlin Steel Corporation, 3 Gateway 
Center, 11 South Pittsburgh, Pennsylvania, U.S.A. 

Dr. Charles Robert Hicks, Statistical Laboratory, Purdue University, 
Lafayette, Indiana, U.S.A. 

Mr. David Hogben, 301 Lyon Hall, Iowa State College, Ames, Iowa, 
US.A. 

Dr. Henry J. Horn, P. O. Box 333, Falls Church, Virginia, U.S.A. 

Mr. Earl E. Housemann, 1005 Hillwood Avenue, Falls Church, Virginia, 
US.A. 

Dr. Cyril J. Hoyt, 111 Burton Hall, University of Minnesota, Minne- 
apolis 14, Minnesota, U.S.A. 

Dr. Harry M. Hughes, 300 Leonidas Drive, San Antonio 10, Texas, 
U.S.A. 

Dr. John Stuart Hunter, 1123 Stilford Avenue, Plainfield, New Jersey, 
US.A. 

Dr. Ray Hyman, Public and Employee Rel. Res. Service, General Elec- 
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tric Company, 570 Lexington Avenue, New York 22, N. Y., U.S.A. 

Mr. Iver A. Iversen, 420—5th Street, S.E., Minneapolis 14, Minnesota, 
US.A. 

Dr. Marvin A. Iverson, 144-70 C Village Road, Jamaica 33, New York, 
US.A. 

Mr. Seymour Jablon, National Research Council, 2101 Constitution 
Avenue, Washington 25, D. C., U.S.A. 

Dr. Emil H. Jebe, Statistical Laboratory, Iowa State College, Ames, 
Towa, U.S.A. 

Dr. Eugene A. Johnson, School of Public Health, University of Minne- 
sota, Minneapolis, Minnesota, U.S.A. 

Mr. S. K. Katti, c/o Statistical Laboratory, Iowa State College, Ames, 
Towa, U.S.A. 

Miss Kathleen M. Keenan, 1944 Palace Avenue, St. Paul 5, Minnesota, 
US.A. 

Dr. Therese Kelleher, Statistical Standards Division, Agricultural 
Marketing Service, USDA, Washington, D. C., U.S.A. 

Dr. C. James Klett, Box 653, Perry Point, Maryland, U.S.A. 

Dr. Leonard 8. Kogan, 105 E. 22 Street, New York 10, N. Y., U.S.A. 

Mr. Gary F. Krause, Department of Entomology, Kansas State College, 
Manhattan, Kansas, U.S.A. 

Mr. William Kruskal, 1227 East 54 Street, Chicago 15, Illinois, U.S.A. 

Mr. W. F. Kuebler, Jr., 8506 Woodson Drive, Overland Park, Kansas, 
U.S.A. 

Mr. Solomon Kunofsky, N. Y. State Department of Health, 84 Holland 
Avenue, Albany, New York, U.S.A. 

Dr. Morton Kupperman, 2300 North John Marshall Drive, Arlington 5, 
Virginia, U.S.A. 

Mr. Thomas E. Kurtz, 24 East Wheelock Street, Hanover, New Hamp- 
shire, U.S.A. 

Mr. Bertrand A. Landry, 505 King Avenue, Columbus 1, Ohio, U.S.A. 

Mr. William Thomas Lewish, Statistics Laboratory, lowa State College, 
Ames, Iowa, U.S.A. 

Mr. Alfred Lieberman, 10224 Leslie Street, Silver Spring, Maryland, 
U.S.A. 

Mr. George F. Lunger, Valley Highlands, Box 561, Route 1, Savage, 
Minnesota, U.S.A. 

Dr. Samuel B. Lyerly, 1204 Holly Street, N.W., Washington 12, D.C., 
US.A. 

Mr. Albert C. Maas, Manager, Statistical Analysis, Green Giant Com- 
pany, Le Sueur, Minnesota, U.S.A. 
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Mr. Donald G. MacEachern, 327—14th Avenue, 8., Minneapolis, Min- 
nesota, U.S.A. 

Mr. John G. Magistad, 2433 Wesley Road, Raleigh, North Carolina, 
U.S.A. 

Mr. John Mandel,Division 7, Section 5, National Bureau of Standards, 
Washington 25, D. C., U.S.A. 

Dr. Carson McGuire, 308 Sutton Hall, University of Texas, Austin 12, 
Texas, U.S.A. 

Mr. F. E. McVay, Department of Experimental Statistics, North Caro- 
lina State College, Raleigh, North Carolina, U.S.A. 

Mr. Jay Mendelsohn, 150-40 71 Avenue, Flushing 67, New York, U.S.A. 

Mr. Donald L. Meyer, 329 Oak Street, S.E., Minneapolis 14, Minnesota, 
US.A. 

Prof. 8S. Raymond Miles, Life Science Bldg., Purdue University, Lafay- 
ette, Indiana, U.S.A. 

Mr. J. T. Miller, Meredith Publishing Company, Des Moines, Iowa, 
U.S.A. 

Dr. Ralph G. Mitchell, Department of Dairy Husbandry, West Virginia 
University, Morgantown, West Virginia, U.S.A. 

Mr. Jack Nadler, 234 Eagle Rock Avenue, West Orange, New Jersey, 
U.S.A. 

Mr. August Carl Nelson, Jr., 948 Fair Oaks Street, Bethel Park, Penn- 
sylvania, U.S.A. 

Mr. Nicholas Nikitich, 20 Featherbed Lane, New York 52, New York, 
US.A. 

Dr. Warren T. Norman, Department of Psychology, University of 
Michigan, Ann Arbor, Michigan, U.S.A. 

Mr. Jack I. Northam, 1833 Waits, Kalamazoo, Michigan, U.S.A. 

Mr. James A. Norton, Jr., Statistical Laboratory, Purdue University, 
West Lafayette, Indiana, U.S.A. 

Dr. Ellis F. Parmenter, Champion Paper and Fibre Company, Hamil- 
ton, Ohio, U.S.A. 

Dr. Sidney Pell, Medical Division, E. I. DuPont de Nemours and Com- 
pany, Inc., Wilmington 98, Delaware, U.S.A. 

Miss Florence M. Pohley, 4715 Farwell Avenue, Lincolnwood 46, 
Illinois, U.S.A. 

Mr. Aloysius Joseph Polaneczky, 307 Lyster Road, Oreland, Penn- 
sylvania, U.S.A. 

Dr. Lawrence M. Potter, Poultry Science Department, Box U-17, Uni- 
versity of Connecticut, Storrs, Connecticut, U.S.A. 

Mr. George B. Pratt, American Can Company, Barrington, Illinois, 
US.A, 
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Mrs. Winifred Riach, 22155 Ann Arbor Tr., Dearborn, Michigan, U.S.A. 

Mr. Robert Roeloffs, 240 Holmes Street, Belleville 9, New Jersey, U.S.A. 

Dr. Arnold J. Rosenthal, 8 Ford Hill Road, Whippany, New Jersey, 
US.A. 

Mr. William Salkind, 90 LaSalle Street, New York 27, New York, 
US.A. 

Mr. Thomas E. Sedlmayr, 1206 J. University Village, East Lansing, 
Michigan, U.S.A. 

Dr. Stanley H. Seeman, 331 Euclid Avenue, Kenmore 23, New York, 
U.S.A. 

Dr. Maynard W. Shelly II, 843 Holmes Avenue, Deerfield, Illinois, 
US.A. 

Mr. Tetsuo Shimamoto, 1736 Beryl Drive, Pittsburgh 27, Pennsylvania, 
U.S.A. 

Miss Delores Shupenka, 14081 Robson, Detroit 27, Michigan, U.S.A. 

Dr. John Skory, Combustion Engineering, Inc., Windsor, Connecticut 
U.S.A. 

Dr. Harry Smith, Jr., 7351 Shewango Way, Cincinnati 43, Ohio, U.S.A. 

Dr. J. E. Keith Smith, Rm. C-330, Lincoln Laboratory, Lexington 73, 
Massachusetts, U.S.A. 

Mr. Robert Lee Smith, 420 Hillside Avenue, Waverly, Ohio, U.S.A. 

Dr. Ralph G. Stanton, Waterloo College, Waterloo, Ontario, Canada. 

Dr. Theodor D. Sterling, Department of Preventive Medicine, College 
of Medicine, University of Cincinnati, Cincinnati, Ohio, U.S.A. 

Mr. Alexander Sternberg, Sonotone Corporation, P. O. Box 200, Elms- 
ford, N. Y., U.S.A. 

Mr. W. T. Stille, Saunders Road, Lake Forest, Illinois, U.S.A. 

Miss Elizabeth Street, N.Y.U.-Bellevue Medical Center, 550 First 
Avenue, New York 16, N.Y., U.S.A. 

Mr. J. G. Strieby, 616 E. Circle Street, Appleton, Wisconsin, U.S.A. 

Dr. Jacques St-Pierre, Center of Statistics, Math Department, Univer- 
sity of Montreal, Quebec, Canada 

Dr. Mae G. Tarver, 1911 W. Estes, Chicago 26, Illinois, U.S.A. 

Mr. Earl A. Thomas, 27 Winding Way, Malvern, Pennsylvania, U.S.A. 

Mr. Morris Edward Tittle, 3702 Cavitt, Bryan, Texas, U.S.A. 

Mr. Glen F. Vogel, 471 Arlington Village, Arlington 4, Virginia, U.S.A. 

Mr. Peter F. Wade, c/o Aluminum Company of Canada, Ltd., Kings- 
ton, Ontario, Canada 

Dr. Ronald E. Walpole, Roanoke College, Salem, Virginia, U.S.A. 

Mr. Francis R. Watson, 400 E. Northwood, Columbus 1, Ohio, U.S.A. 

Mr. Vernon E. Weckworth, 933 Sheridan Road, Wilmette, Illinois, 
U.S.A. 
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Mr. Irving Weiss, Bell Telephone Laboratories, North Andover, Massa- 
chusetts, U.S.A. 

Mr. Eugene H. Welsand, 3411S. 10th Street, Milwaukee 15, Wisconsin, 
US.A. 

Miss Roberta A. Wilcox, R.F.D., Hope Valley, Rhode Island, U.S.A. 

Mr. Gregory P. Williams, 2725 Stratford, Cincinnati 20, Ohio, U.S.A. 

Dr. Myron J. Willis, 2550 Sharondale Ct., N.E., Atlanta 5, Georgia, 
US.A. 

Dr. R. Lowell Wine, Hollins College, Virginia, U.S.A. 

Dr. James Winer, Department of Psychology, Purdue University, 
Lafayette, Indiana, U.S.A. . 


Mr. Kuo Chuan Woo, Box 5393, State College Station, Raleigh, North 
Carolina, U.S.A. 


Mr. Pao-lo Yu, 3840 Greystone Avenue, New York 63, N. Y., U.S.A. 
German Region 


Dr. Hans Klinger, Herzberger Landstr. 2b, Gottingen, Germany 
Prof. Dr. Eduard von Boguslawski, Ludwigstr. 23, Giessen-Lahn, 
Germany 


Italian Region 


Dr. G. Maffii, c/o Lepetit S.p.a., via R. Lepetit 10, Milano, Italy 

Dr. A. M. Pandolfi, via Strambio 34/16, Milano, Italy 

Dr. G. Wittmer, Istituto Scientifico, Sperimentale per i Tabacchi, via 
Nazionale 66, Rome, Italy 


Netherlands Region 


Dr. W. Lammers, Sterrebosch 1, Utrecht, Netherlands 

Mr. John W. Mayne, Ruychrocklaan 208, The Hague, Netherlands 

Mr. P. Ubbels, Instituut voor de Pluimveeteelt ‘Het Spelderholt,”’ 
Beekbergen, Netherlands 

Prof. Dr. H. R. van der Vaart, Boshuizenkade 11, Leiden, The Nether- 
lands 
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NEWS AND ANNOUNCEMENTS 


Members are invited to transmit to their National or Regional Secre- 
tary (if members at large, to the General Secretary) news of appointments, 
distinctions, or retirements, and announcements of professional interest. 


EDITOR’S ANNOUNCEMENT 


It is with great pleasure that we announce that Dr. David J. Finney 
has agreed to edit the section on Queries and Notes in Biometrics, suc- 
ceeding Professor G. W. Snedecor. The section on Queries and Notes 
under the editorship of Professor Snedecor has long been a feature of 
Biometrics which received many favorable comments from subscribers. 
Dr. Finney plans to continue this section as it has been developed. 
Contributions for the section on Queries and Notes should be sent to 
Dr. Finney, in triplicate, at the Department of Statistics, University of 
Aberdeen, Meston Walk, Old Aberdeen, Scotland. 

A section on book reviews is being considered for Biometrics. A 
committee consisting of Dr. C. I. Bliss, chairman, Dr. H. W. Norton, 
Dr. 8. C. Pearce, and Dr. Luigi Cavalli-Sforza has been appointed to 
define policy and to make recommendations on implementing those 
policies. Comments or suggestions should be sent to Dr. C. I. Bliss, 
Connecticut Agricultural Experiment Station, Post Office Box 1106, 
New Haven 4, Connecticut. 


MATHEMATICS RESEARCH CENTER SYMPOSIUM 


A Symposium on Boundary Problems in Differential Equations, 
with especial reference to recent developments in this field, will be 
held by the Mathematics Research Center, United States Army, at the 
University of Wisconsin, April 20-22, 1959. Invited speakers, about 
twenty in number, will each present a thirty-minute paper. Both ordi- 
nary and partial differential equations will be considered, the emphasis 
to be upon methods that are potentially adapted to computation. Be- 
tween lectures there will be discussion periods and coffee breaks. The 
proceedings of the Symposium will be published. 

Among the speakers, European and American, will be L. Collatz, 
G. Fichera, L. Fox, W. T. Koiter, J. Schréder, I. N. Sneddon, R. Bell- 
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man, G. Birkhoff, H. Bueckner, R. Courant, J. B. Diaz, J. Douglas, 
K. Friedrichs, P. Garabedian, B. A. Troesch, R. Varga, C. Wilcox, 
and D. Young. 

Persons interested in attending the symposium may receive the 
program and other details by writing to R. E. Langer, Director Mathe- 
matics Research Center, U. 8. Army, 1118 West Johnson Street, Madi- 
son 6, Wisconsin. 


STATISTICAL RESEARCH MONOGRAPHS 


The Institute of Mathematical Statistics and the University of 
Chicago have established a series of publications entitled Statistical 
Research Monographs. 

The primary purpose of this series is to provide a medium of publi- 
cation for material of interest to statisticians that is not ordinarily 
provided for by existing media. It will help fill the gap between journal 
articles and textbooks or treatises. Among the kinds of publications 
envisaged are: 


New research results too lengthy for the usual journal article. In 


particular, authors will have ample scope for detailed exposition of 
their findings. 


Research results of interest in both theoretical and applied statistics. 
At present authors of such material frequently find it necessary to 
publish part of their results in a theoretical journal and part in an 
applied journal. 


Expository monographs in particular areas of statistics. 


Discussions of statistical problems and techniques in particular areas 
of application. 


Every attempt will be made to maintain the highest standards of 
scholarship. 

The Editorial Board consists of David Blackwell (University of 
California), William G. Cochran (Harvard University), Henry E. 
Daniels (University of Birmingham), Leo A. Goodman (University of 
Chicago), Wassily Hoeffding (University of North Carolina), Jack C. 
Kiefer (Cornell University), and William H. Kruskal (University of 
Chicago). 

Authors are invited to send manuscripts and correspondence con- 
cerning the series to Leo A. Goodman, Department of Statistics, Uni- 
versity of Chicago, Chicago 37, Illinois. 
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REVIEW OF MATHEMATICAL AND STATISTICAL TABLES 


Mathematical Tables and Other Aids to Computation tries to review 
all material containing substantial mathematical or statistical tables, 
material of substantial general interest pertaining to numerical analysis 
(usually of book length), and material of wide interest pertaining to 
other aids to computation, including automatic computing machinery. 
Authors of published papers and less widely circulated reports may sub- 
mit reprints or copies of their reports for publication to Harry Polachek, 
Editor, Mathematical Tables and Other Aids to Computation, David 
Taylor Model Basin, Washington 7, D. C. 


INDIAN SOCIETY OF AGRICULTURAL STATISTICS 


The Indian Society of Agricultural Statistics has decided to institute 
prizes for the promotion of research in statistics. In addition research 
assistance for promotion of research in statistics is also available. The 
prizes will be awarded for articles of sufficiently high merit published 
in the Journal of the Society. The research assistance is designed to 
encourage promising workers who are not otherwise in a position to 
pursue research due to lack of suitable facilities. Applications and 
additional information may be obtained from the Secretary, Indian 
Society of Agricultural Statistics, c/o Indian Council of Agricultural 
Research Statistical Wing, Library Avenue, New Delhi-12, India. 


SOUTHERN REGIONAL GRADUATE SUMMER SESSION 
IN STATISTICS AT NORTH CAROLINA 
STATE COLLEGE 


The 1959 session of the Southern Regional Graduate Summer Ses- 
sion in Statistics will be held at North Carolina State College, Raleigh, 
from June 8 to July 17, 1959. 

North Carolina State College, Virginia Polytechnic Institute, Uni- 
versity of Florida, and Oklahoma State University have agreed to 
operate a continuing program of graduate summer sessions in statistics 
to be held at each institution in rotation; the program was instituted 
at Virginia Polytechnic Institute in the summer of 1954. 

The 1959 session, like previous sessions under this program, is in- 
tended to serve: 1) teachers of introductory statistical courses who 
want formal training in modern statistics; 2) research and professional 
workers who want intensive instruction in basic statistical concepts 
and modern statistical methodology; 3) professional statisticians who 
wish to keep informed about advanced specialized theory and methods; 
4) prospective candidates for graduate degrees in statistics; and 5) 
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graduate students in other fields who desire supporting work in sta- 
tistics. 

The session will last six weeks and courses will carry three semester 
hours of credit. Not more than two courses may be taken for credit 
at any one session. The summer work in statistics may be applied as 
residence credit at any of the cooperating institutions, as well as certain 
other universities, in partial fulfillment of the requirements for a gradu- 
ate degree. The program may be entered at any session, and consecu- 
tive courses will follow in successive summers so that it would be pos- 
sible for a student to complete the course work in statistics for a Master’s 
degree in three summers. Students must satisfy the remaining re- 
quirements for course work and thesis at the institution where they 
are to be admitted to candidacy. The advanced courses may be accepted 
as part of the Ph. D. program of the participating institutions. 

The courses to be offered in statistics in 1959 at Raleigh are as 
follows: Statistical Methods I and II, Statistical Theory I, II and III, 
Theory of Sampling Applied to Survey Design, Stochastic Processes 
and Their Applications, Methods of Operations Research, and Advanced 
Topics in Statistical Methods. 

A number of courses will, in addition, be available in the Mathe- 
matics Department. Courses to be offered during the June 8-July 17 
period of the Summer Session in Statistics include: Differential Equa- 
tions, Introduction to Determinants and Matrices, Introduction to 
Applied Mathematics, Numerical Analysis, Advanced Calculus I, and 
Boundary Value Problems. 

For students interested in the regular 12-week summer session at 
North Carolina State College, additional Mathematics courses are 
available: Advanced Calculus II and Advanced Differential Equations 
will be offered in the second 6-week session beginning July 20; Complex 
Variable Theory and Vector Spaces and Matrices will be offered over 
the 12 weeks of both sessions. The formal course offerings will be 
supplemented by seminars and special lectures. 

Requests for application blanks for the summer school should be 
addressed to: F. IE. McVay, Department of Experimental Statistics, 
North Carolina State College, Raleigh, North Carolina. 


SUMMER OFFERINGS IN STATISTICS AT 
IOWA STATE COLLEGE 


The Department of Statistics at Iowa State College will offer six 
applied courses in statistical theory and methods in its two 1959 sum- 
mer sessions. These courses are planned primarily for graduate students 
or research workers with limited mathematical backgrounds who wish 
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to use statistical techniques intelligently for application to other fields. 
In addition, a course on special topics in theoretical or applied statistics 
may be studied at the graduate level. Senior staff members will be 
available during most of the summer for consultations on research or 
special problems. 

Students may register for either or both of the six-week summer 
sessions: June 8—July 15 and July 15-August 21. The complete list of 
statistics offerings for the first session is as follows: Stat. 401, “Sta- 
tistical Methods for Research Workers” (at the level of Snedecor’s 
Statistical Methods); Stat. 447, “Statistical Theory for Research 
Workers” (mainly theory of experimental statistics at the level of 
Anderson and Bancroft’s Statistical Theory in Research); Stat. 599, 
‘Special Topics;” and Stat. 699, “Research.” In the second session 
will be offered Stat. 402, a continuation of 401; Stat. 448, a continua- 
tion of 447; two courses in applied methods which are more specialized— 
Stat. 411, “Experimental Designs for Research Workers,” and Stat. 
421, “Survey Designs for Research Workers;” and finally Stat. 599 
and 699. Additional information may be obtained from T. A. Ban- 
croft, department head and Director, Statistical Laboratory, Iowa 
State College, Ames, Iowa, U.S.A. 


PURDUE UNIVERSITY 


Purdue University announces the establishment of the Population 
Genetics Institute to coordinate an expanding research program in 
population genetics. 

The major objective of the Institute will be to investigate the ef- 
fects of various types of gene action on response to selection under 
various mating systems and varying environmental conditions. 

In addition to theoretical studies, problems will be investigated 
experimentally with laboratory organisms such as Drosophila, Tribo- 
lium, and mice. This work is currently being conducted in the Depart- 
ments of Dairy and Poultry Science, the Purdue Statistical and Com- 
puting Laboratory, the North Central States regional poultry breeding 
laboratory, and the pioneering research laboratory for animal genetics, 
Animal Husbandry Research Division, ARS, USDA. 

Facilities for the Institute in Purdue’s new Life Science Building 
will include offices, fully equipped laboratories, and three specially 
designed environment controlled chambers. The facilities of the sta- 
tistical and computing laboratory, including a digital computer, will be 
used for expanding the theoretical or mathematical approach to prob- 
lems of population genetics. 
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Dr. A. E. Bell, professor of Poultry Science, was named chairman 
of the Institute. Others from the Purdue staff named to the Institute 
include Drs. V. L. Anderson, B. B. Bohren, 8. C. King, W. H. Kyle, 
J. H. Martin, T. G. Martin, and H. E. McKean. 
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ARTICLES 
‘Student’ and Small Sample B. L. WEetcu 
On Grouping for Maximum Homogeneity.............. Water D. FisHER 
Significance Tests in Parallel and in Series...................-- I. J. Goop 
Ordinal Measures of H. Krouskan 
Curtailed Sampling for Variables.................... Norman R. GARNER 
On the Studentized Smallest Chi-Square............. K. V. RAMACHANDRAN 


The Estimation of the Parameters of a Linear Regression System Obeying 


Alternative Definitions of the Serial Correlation Coefficient in Short Auto- 


A Stochastic Analysis of the Size Distribution of Firms. ..Inma G. ADELMAN 
Some Analyses of Income-Food Relationships....... Marcuenrite C. Burk 
A Cross-Section Analysis of Non-Business Air Travel 


JouN B. LANSING AND Dwicut M. Bioop 


Approaches to National Output Mcasurement............ Pau B. Smeson 
Notes on Immigration Statistics of the United States..... E. P. Hurcuinson 


BOOK REVIEWS 


AMERICAN STATISTICAL ASSOCIATION 
1757 K Street, N.W., Washington 6, D.C. 


Information on memberships, subscriptions, and back numbers should be 
requested from the Executive Director, American Statistical Association, 
1757 K Street, N.W., Washington 6, D. C., U.S.A. 
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INFORMATION FOR CONTRIBUTORS 


MANUSCRIPTS 


Contributions for Biometrics may be addressed to Dr. Ralph A. Bradley, Depart- 
ment of Statistics, Virginia Polytechnic Institute, Blacksburg, Virginia, U.S.A.; 
authors residing in the following Society Regions can expedite consideration of papers 
by submitting them to the appropriate Associate Editor, namely; BRITISH RE- 
GION: Dr. 8. C. Pearce, East Malling Research Station, East Malling, Maidstone, 
Kent, England; AUSTRALASIAN REGION: Dr. E. A. Cornish, University of 
Adelaide, Adelaide, Australia; FRENCH REGION: Dr. Georges Teissier, Faculté 
des Sciences de Paris, 1 rue V. Cousin, Paris, France. QUERIES, NOTES, and 
related correspondence should be directed to Dr. D. J. Finney, Department of 
Statistics, University of Aberdeen, Meston Walk, Old Aberdeen, Scotland. 

MANUSCRIPTS must be submitted in triplicate, with typescript doublespaced 
throughout. Marginal notes may obviate typographical difficulties presented by 
complicated formulae or tables—authors should not attempt editorial instructions 
or markings for the printer. TABLES should be identified by arabic number and 
by a short descriptive title. ILLUSTRATIONS should also be identified by arabic 
number and by a brief caption. (Captions should not be included in illustrations, 
but should be typewritten collectively on an accompanying sheet.) Originals 
should be approximately 8.5 x 11 in. (21.5 x 28 cm.). The original of each chart, 
diagram, or graph should be executed in black on white drawing paper or board, on 
blue tracing linen, or on coordinate paper ruled in blue only; coordinate lines to be 
reproduced should be ruled in black. For printing, illustrations may be reduced to 
¥ or \ original dimensions. Lines should therefore be of sufficient thickness, and 
decimal points, periods, and stippled dots should be solid black circles large enough 
to reproduce well. Lettering and numerals should be at least 1 mm. high when 
reproduced in a cut 3 in. (7.5 cm.) wide. Photographs should be prints on glossy 
paper with strong contrasts, and if grouped in a plate should be mounted contig- 
uously. All tables and illustrations should be mentioned explicitly in the text. 
REFERENCES (BIBLIOGRAPHIC) should be collectively listed alphabetically 
by author; textual citation by author and year is preferred. 


ABSTRACTS 


Abstracts of papers presented at meetings of the Biometric Society or of its 
regions are printed in Biometrics following such meetings. They should be submitted 
to the person designated to receive them for a particular meeting in exactly the form 
published in Biometrics (except for an Abstract Number), doublespaced on bond 
paper, and in duplicate. Use of formulae requiring display printing is to be avoided. 


Notices, ANNOUNCEMENTS, AND Biometric Society Reports 


International and regional reports and notices should be submitted by the 
appropriate officers of the Society and its Regions in duplicate doublespaced on 
separate sheets exactly as they are to be printed in Biometrics. Other material to 
be printed in News and Announcements should also be submitted doublespaced 
and in duplicate. 


SusTarntnc MEMBERS OF THE Biometric Society 


Abbott Laboratories 
American Cancer Society, Inc. 
Merck, Sharp and Dohme Research Laboratories 
Schering Corporation 

Smith, Kline and French Laboratories 

E. R. Squibb and Sons 

Wallace Laboratories, Division of Carter Products 
Wyeth Institute of Applied Biochemistry 
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BACK ISSUES 


Back issues of Biometrics are available at the following postage-paid 
prices in U.S.A. currency: 

Price per Price per 
Year Volume Number’ Single Number Volume(unbound) 


1945 1 1 to 6 $1.00 $6.00 
1946 2 1 to 6 1.00 6.00 
1947 3 lto4 1.50 5.00 
1948 4 1to4 1.50 5.00 
1949 5 1to4 1.50 5.00 
1950 6 1 to 4 1.50 5.00 
1951 7 1to4 2.00 8.00 
1952 8 1lto4 2.00 8.00 
1953 9 1to4 2.00 8.00 
1954 10 1 to 4 2.00 8.00 
1955 11 1to4 2.00 8.00 
1956 12 1 to4 2.00 8.00 
1957 13 1to4 2.00 8.00 
1958 14 1to4 2.00 8.00 


Reprints of individual articles are not available except to authors at the 
time of printing. Three special issues are among the numbers listed 
above. They are: | 


1947 Volume 3 Number 1 The Analysis of Variance 
1951 Volume 7 Number 1 Components of Variance 
1957 Volume 13 Number 3 The Analysis of Covariance 


Also available are: 
Fishery Reprint Series (Selected reprints from Vol. 5) $1. 
Subject Index (Volumes 1-10) 1 
Proceedings, International Biometric Symposium, 
Campinas, Brazil, 1955. 


Inquiries, non-member subscriptions, and orders for back issues and 
other material listed above should be addressed to: Biomerrics, DEPART- 
MENT OF STATISTICS, VIRGINIA PoLYTECHNIC INSTITUTE, BLACKSBURG, 
U.S.A. 
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